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Texas Instruments 
(TI) reserves 
the right to make changes 
to or 
to discontinue 
any semiconductor 
product 
or service 
identified 
in this 
publication 
without 
notice. 
TI advises 
its customers 
to 
obtain 
the latest 
version 
of the 
relevant 
information 
to verify, 
before 
placing 
orders, 
that 
the information 
being relied upon 
is 


current. 


TI warrants 
performance 
of its semiconductor 
products 
to current 
specifications 
in accordance 
with TI's standard 
warranty. 
Testing 
and other quality 
control 
techniques 
are utilized 
to the extent 
TI 
deems necessary 
to support 
this warranty. 
Unless mandated 
by 
government 
requirements, 
specific 
testing 
of all parameters 
of 


each device 
is not necessarily 
performed. 


TI assumes 
no liability 
for TI applications 
assistance, 
customer 
product 
design, software 
performance, 
or infringement 
of patents 
or services described 
h.erein. Nor does TI warrant 
or represent that 
any license, either express or implied, 
is granted 
under any patent 
right, 
copyright, 
mask work 
right, 
or other 
intellectual 
property 
right of TI covering 
or relating 
to any combination, 
machine, 
or 
process in which 
such semiconductor 
products 
or services 
might 
be or are used. 


Texas Instruments 
products 
are not intended for use in life-support 
appliances, 
devices 
or systems. 
Use of 
a TI product 
in such 
applications 
without 
the written 
consent 
of the appropriate 
TI 


officer 
is prohibited. 
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In this 
manual, 
Texas 
Instruments 
presents 
technical 
information 
on the 
TI 
SN74ACT8800 
family 
of 
32-bit 
processor 
"building 
block" 
circuits. 
The 
SN74ACT8800 family is composed of single-chipVLSI processor functions, all of which 
are designed for high-complexity 
processing applications. 


This manual includes specifications and operational information on the following high- 
performance advanced-CMOS devices: 


SN74ACT8818A 
SN74ACT8832A 
SN74ACT8836A 
SN74ACT8841A 
SN74ACT8847 
SN74ACT8867 


16-bit microsequencer 
32-bit registered ALU 
32- x 32-bit parallel multiplier/accumulator 
Digital crossbar switch 
64-bit floating-point/integer 
unit 
32-bit vector processor unit 


These high-speed devices operate at or above 20 MHz, while providing the low power 
consumption of TI's advanced 1-Jlm 
EPIC· CMOS technology. 
The EPIC· CMOS 
process combines twin-well structures for increased density with 
1-Jlm gate lengths 
for increased speed. 


The SN74ACT8800 
Fami/y Data Manual contains design and specification data for 
all five devices previously listed and includes additional programming and operational 
information for the' ACT8818A, 'ACT8832A, 
and' ACT884 7. Two application notes, 
"Chebyshev Routines for the SN74ACT884 7" and "High-speed Vector Math and 3-D 
Graphics Using the SN74ACT8847 
Floating Point Unit" 
are also included. 


Introductory sections of the manual include an overview of the '8800 family and a 
summary of the software tools and design support TI offers for the chip-set. The general 
information 
section 
includes an explanation 
of the 
function 
tables, 
parameter 
measurement information, and typical characteristics related to the products listed 
in this volume. 


Package dimensions are given in the Mechanical Data section of the book in metric 
measurement (and parenthetically in inches). 


Complete technical data for any Texas Instruments semicondutor product is available 
from your nearest TI field salesoffice, local authorized TI distributor, or by calling Texas 
Instruments at 1-800-232-3200. 
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Texas Instruments SN74ACT8800 family of 32-bit processor building blocks has been 
developed to allow the easy, custom design of functionally 
sophisticated, high- 
performance processor systems. The '8800 family is composed of single-chip, VLSI 
devices, each of which represents an element of a CPU. 


The '8800 chip set provides the performance, functionality, and flexibility to fill the 
most demanding processing needs and is structured to reduce system design cost 
and effort. Most of these high-speedprocessorfunctions operateat 25 MHz and above, 
and, at the same time, provide the power savings of Tl's advanced, 1 JLm EPIC'· 
(Enhanced Performance Implanted CMOS) CMOS technology. 


Gearedfor computationally intensive applications, SN74ACT8800 devicesinclude high- 
performance ALUs, multipliers, microsequencers, and both vector and floating-point 
processor units. 


The family's building block approach allows the easy, "pick-and-choose" 
creation of 
customized processor systems, while the devices' high level of integration provides 
cost-effectiveness. 


Designed especially for high-complexity processing, the devices in the '8800 family 
offer a range of functional options. Device features include three-port architecture, 
double-precision accuracy, optional pipelined operation, and built-in fault tolerance. 


Array, digital signal, image, and graphics processing can be optimized with '8800 
devices. Other applications are found in supermini and fault-tolerant computers, and 
I/O and network controllers. 


TI's '8800 32-bit processor building block family comprises the following functions: 


• 
SN74ACT8818A 
16-bit microsequencer 


• 
SN74ACT8832A 
32-bit registered ALU 


• 
SN74ACT8836A 
32- x 32-bit parallel multiplier/accumulator 
• 
SN74ACT8841 A digital crossbar switch 


• 
SN74ACT8847 64-bit floating-point and integer unit 


• 
SN74ACT8867 32-bit vector processor unit 


• 
Bipolar Support Chips 
• 
SN74AS8838 32-bit barrel shifter 


With instruction cycle times of 40 ns or less and the low power consumption of EPIC· 
CMOS, the '8800 
chip set offers an unrivaled speed/power combination. 
Unlike 
traditional microprocessors, which require multiple cycles to perform an operation, 
the' ACT8800 processors typically can complete instructions in a single cycle. 


The' ACT8832A registered ALU and' ACT8818A microsequencer together create a 
powerful 25-MHz CPU. Because instructions can be performed in a single cycle, the 
'ACT8832A!' ACT8818A combination is capable of executing 25 million instructions 
per second (MIPS). 
. 


For math-intensive applications, the' ACT8836A fixed-point multiplier/accumulator 
(MAC), 'ACT884 7 64-bit floating-point and integer unit and' ACT8867 32-bit vector 
processor unit offer unprecedented computational power. 


The exceptional performance of the '8800 family is made possible by TI's EPIC· CMOS 
technology. The EPIC· CMOS process combines twin-well structures for increased 
density with 1-J-tmgate lengths for increased speed. 


The '8800 family is designed with a variety of architectural and functional options 
to provide maximum design flexibility. 
These device features allow the creation of 


"customized" 
solutions with the '8800 chipset. 


A building block approach to processingallows designersto match specializedhardware 
to their specific design needs. The' ACT8818A!' ACT8832A combination forms the 
basis of the system, a high-speed CPU. For applications requiring high-speed integer 
multiplication, the' ACT8836A can be added. To provide the high precision and large 
dynamic range of floating point numbers, the 'ACT884 7 can be employed. The 
'ACT8867 
forms the core processor of a graphics subsystem. For multiprocessing 
applications, the' ACT8841 A can be used to eliminate memory bottlenecks. 


To ensure speed and flexibility, each component of the '8800 family has three data 
ports. Each data port accommodates 32 bits of data, plus four parity bits. This 
architecture eliminates many of the I/O bottlenecks associated with traditional single- 
I/O microprocessors. 


The three-port architecture and functional partitioning of the '8800 chip-set opens 
the door to a variety of parallel processing applications. Placing the math and shifting 
functions in parallel with the ALU permits concurrent processing of data. Additional 
processors can be added when performance needs dictate. 


The 'ACT8800 
building block processors are microprogrammable, so that their 
instruction 
sets can be tailored to a specific application. 
This high degree 
of 
programmability offers greater speed and flexibility than a typical microprocessor and 
ensures the most efficient use of hardware. 


A separatecontrol bus eliminatesthe needfor multiplexing instructions and data, further 
reducing processing bottlenecks. The microcode bus width is determined by the 
designer and the application. 


Another source of design flexibility is provided by the pipelined/flowthrough operation 
option. Pipelining can dramatically reduce the time required to perform iterative or 
sequential calculations. On the other hand, random or nonsequential algorithms require 
low latency operations. The '8800 chipset allows the designerto select the mode (fully 
pipelined, partially pipelined, or nonpipelined) most suited to each design. 


The '8800 family is designed to support applications that require double-precision 
accuracy. Many scientific applications, such as those in the areasof high-end graphics. 
digital signal processing, and array processing, require such accuracy to maintain data 
integrity. In general-purpose computing applications, floating-point processors must 
often support double-precision data formats to maintain compatibility with existing 
software. 


To ensure data integrity, '8800 devices (excluding the microsequencer) support parity 
checking and generation. as well as master/slave error detection. Byte parity checking 
is performed on the input ports. and a parity generator and a master/slave comparator 
are provided at the output. Fault tolerance is built into the processors, ensuring correct 
device operation without extra logic or costly software. 


In a high-performance microcoded system, a fast microcode controller is required to 
control the flow of instructions. The SN74ACT8818A is a high-speed. versatile 16-bit 
microsequencer capable of addressing 64K words 
of microcode memory. 
The 


•ACT8818A can address the next instruction fast enough to support a 30-ns system 
cycle time. 


The' ACT8818A 65-word-deep by 16-bit-wide stack is useful for storing subroutine 
return addresses, top of loop addresses. and loop counts. Addresses can be sourced 
from eight different sources: the three I/O ports, the two register counters, the 
microprogram counter, the stack, and the 16-way branch. 


The SN74ACT8832A 
is a 32-bit registered ALU that operates at 25 MHz. Because 
instructions are performed in a single cycle, the' ACT8832A is capable of executing 
25 million microinstructions per second. An on-board 64-word register file is 36-bits- 
wide (to permit the storage of parity bits). The 3-operand register file increases 
performance by enabling the creation of an instruction and the storage of the previous 
result in a single cycle. To facilitate data transfer, operands stored in the register file 


can be accessed externally, while the ALU is executing. To support the parallel 
processing of data, the' ACT8832A can be configured to operate as four 8-bit ALUs, 
two 16-bit ALUs, or a single 32-bit ALU. The' ACT8832A incorporates 32-bit shifters 
for double-precision shift operations. 


The SN74ACT8836A 
is a 32-bit integer multiplier/accumulator 
(MAC) that accepts 
two 32-bit inputs and computes a 64-bit product. The device can also operate as a 
64-bit by 64-bit multiplier. An onboard adder is provided to add or subtract the product 
or the complement of the product from the accumulator. 


When pipelined internally, the 1-J.tmCMOS parallel MAC performs a full 32- x 32-bit 
multiply/accumulate in a single 30-ns clock cycle. In flowthrough mode (without any 
pipeliningl, 
the 'ACT8836A 
takes 48 ns to multiply 
two 
32-bit 
numbers. The 
'ACT8836A performs a 64- x 64-bit multiply/accumulate, outputting a 64-bit result, 
in 194 ns. 


The' ACT8836A can handle a wide variety of data types, including two's complement, 
signed, and mixed. Division is supported via the Newton-Raphson algorithm. 


The SN74ACT8841 A is a high-performancedigital crossbarswitch that cost-effectively 
eliminates bottlenecks and enables data to speed through complex bus architectures. 


The' ACT8841 A is ideal for multiprocessor applications, where memory bottlenecks 
tend to occur. The device has 64 bidirectional I/O ports that can be configured as 16 
4-bit ports, 8 8-bit ports, or 4 16-bit ports. Each bidirectional port can be connected 
in any conceivable combination. Any single input port can be broadcast to any 
combination of output ports every 19 ns. The transfer time from data in to data out 
is 14 ns. 


The control sources for ten separate switching configurations are on-chip, including 
eight banks of programmable control flip-flops and two hard-wired control circuits. 


The EPICTM 
CMOS SN74ACT8841 A and its predecessor, SN74AS8840, are based on 
the same architecture, differing in power consumption, number of control registers, 
and pin-out. Microcode written for the' AS8840 can be run on the' ACT8841 A. 


The SN74ACT8847 is a high-speed 64-bit floating-point and integer unit. The device 
is 
fully 
compatible 
with 
IEEE standard 
754-1985 
for 
addition, 
subtraction, 


multiplication, 
division, 
square root, 
and comparison. 
Division and square-root 
operations are implemented via hardwired control. 


The' ACT884 7 also performs integer arithmetic, logical operations, and logical shifts. 
Registers are provided at the inputs, outputs, and inside the ALU and multiplier to 
support multilevel 
pipelining. These registers can be bypassed for nonpipelined 
operations. 


When fully pipelined,the' ACT884 7 can perform single-precisionfloating-point or 32-bit 
integer operation in 30 ns for 66 MFLOPS(million floating-point operations per second) 
sustained. 


DEVICE 
PACKAGE 
CLOCK PERIOD 


SN74ACT8847-30 
207-pin PGA 
30 ns 


SN74ACT8847-40 
207-pin PGA 
40 ns 


The SN74ACT8867 Vector Processor Unit (VPU)is a single-precision (32-bit) floating- 
point and integer processor optimized for graphics and signal processing. The 
32-MFLOPS device 
is designed to accept either Digital Equipment Corporation's (DEC) 
floating 
point 'F' format, 
2s complement integers, or 2s complement fractional 
numbers. The output is either DEC 'F' format, integer format, or a variety 
of possible 
fractional numbers with varying radix point placement. All fixed point operands are 
in 2s complement notation. 


The instruction set for the' ACT8867 is partitioned so that the two processing units, 
the ALU processor and the multiplier processor, can be operated independently or in 
parallel. 


The' ACT8867 has two 32-bit input data buses and a 32-bit output data bus. The 
device 
supports 40-bit floating point internally and also supports 2s complement integer 
operations. Forty-six general-purpose registers allow intermediate results to be stored 
internally. 
Two 
other 
pseudo-register 
locations, 
when 
addressed, 
serve 
as a 
flowthrough 
function in the register file. In this mode, the register file is bypassed, 
allowing data to be fed directly from an external source to the processing units. 


Single-precision floating point Newton-Raphson division and square root are supported, 
along with 
limited support for double-precision arithmetic. 
Appropriate 
status is 
generated for all instructions and data types. The VPU can be operated in flowthrough 
mode, or pipeline registers are availableto allow operation in pipelined mode to increase 
system speed. All pipeline registers may be placed in a scan mode for device-level 
and board-level diagnostics. 


Bipolar Support Chips 


The SN74AS8838 high-speed, 32-bit barrel shifter can shift up to 32 bits in a single 
instruction cycle of under 25 ns. Five basic shifts can be programmed: circular left, 
circular right, logical left, logical right, and arithmetic right. The' AS8838 offloads the 
responsibility for shifting operations from the ALU, which increasesshifter functionality 
and system throughput. 


These symbols, terms, and definitions are in accordance with those currently agreed 
upon by the JEDECCouncil of the Electronic Industries Association (EIA) for use in 
the USA and by the International Electrotechnical Commission (IEC)for international 
use. 


Operating Conditions and Characteristics (in Sequence by Letter 
Symbols) 


fmax Maximum clock frequency 
The highest rate at which the clock input of a bistable circuit can be driven 
through its required sequence while maintaining stable transitions of logic level 
at the output with input conditions established that should cause changes of 
output logic level in accordance with the specification. 


ICC 
Supply current 
The current into t the VCC supply terminal of an integrated circuit. 


ICCO Supply current, quiescent 
The current into t the VCC supply terminal of an integrated circuit when all (or 
a specified number) of the outputs do not change and the internal logic is in 
an idle state. 


IIH 
High-level input current 
The current into t an input when a high-level voltage is applied to that input. 


IlL 
Low-level input current 
The current into t an input when a low-level voltage is applied to that input. 


10H 
High-level output current 
The current into t an output with input conditions applied that, according to 
the product specification, will establish a high level at the output. 


10L 
Low-level output current 
The current into t an output with input conditions applied that, according to 
the product specification, will establish a low level at the output. 


10S' 
Short-circuit output current 
The current into t an output when that output is short-circuited to ground (or 
other specified potential) with input conditions applied to establish the output 
logic level farthest from ground potential (or other specified potential). 


10Z 
Off-state (high-impedance-state)output current (of a three-state output) 
The current flowing into t an output having three-state capability with input 
conditions established that, according to the production specification, 
will 
establish the high-impedance state at the output. 


ta 
Access time 
The time interval between the application of a specified input pulse and the 
availability of valid signals at an output. 


tdis 
Disable time (of a three-state output) 
The time interval between the specified referencepoints on the input and output 
voltage waveforms, with the three-state output changing from either of the 
defined active levels (high or low) to a high-impedance (off) state. (tdis = tpHZ 
or tpLZ). 


ten 
Enable time (of a three-state output) 
The time interval between the specified referencepoints on the input and output 
voltage waveforms, with the three-state output changingfrom a high-impedance 
(off) state to either of the defined active levels (highor low). (ten=tpZH or tpzLl. 


tf 
Fall time 
The time interval between two referencepoints (90% and 10% unlessotherwise 
specified) on a waveform that is changing from the defined high level to the 
defined low level. 


th 
Hold time 
The time interval during which a signal is retained at a specified input terminal 
after an active transition occurs at another specified input terminal. 
Notes: 
1. The hold time is the actual time interval between two signal events and is 
determined by the system in which the digital circuit operates. A minimum 
value is specified that is the shortest interval for which correct operation 
of the digital circuit is guaranteed. 
2. The hold time may have a negative value in which case the minimum limit 
defines the longest interval (between the releaseof the signal and the active 
transition) for which correct operation of the digital circuit is guaranteed. 


tpd 
Propagation delay time 
The time between the specified referencepoints on the input and output voltage 
waveforms with the output changing from one defined level (high or low) to 
the other defined level. (tpd = tpHL or tPLH). 


tpHL 
Propagation delay time. high-to-Iow level output 
The time between the specified reference points on the input and output voltage 
waveforms with the output changing from the defined high level to the defined 
low level. 


tpHZ 
Disable time (of a three-state output) from high level 
The time interval between the specified reference points on the input and the 
output voltage waveforms with the three-state output changingfrom the defined 
high level to a high-impedance (off) state. 


tpLH 
Propagation delay time. low-to-high-Ievel output 
The time between the specified reference points on the input and output voltage 
waveforms with the output changing from the defined low level to the defined 
high level. 


tpLZ 
Disable time (of a three-state output) from low level 
The time interval between the specified reference points on the input and output 
voltage waveforms with the three-state output changing from the defined low 
level to a high-impedance (off) state. 


tpZH 
Enable time (of a three-state output) to high level 
The time interval between the specified reference points on the input and output 
voltage waveforms with the three-state output changing from a high-impedance 
(off) state to the defined high level. 


tpZL 
Enable time (of a three-state output) to low level 
The time interval between the specified reference points on the input and output 
voltage waveforms with the three-state output changing from a high-impedance 
(off) state to the defined low level. 


tr 
Rise time 
The time interval between two referencepoints (10% and 90% unlessotherwise 
specified) on a waveform that is changing from the defined low level to the 
defined high level. 


tsu 
Setup time 
The time interval between the application of a signal at a specified input terminal 
and a subsequent active transition at another specified input terminal. 
Notes: 
1. The setup time is the actual time interval between two signal events and 
is determined by the system in which the digital circuit operates. A minimum 
value is specified that is the shortest interval for which correct operation 
of the digital circuit is guaranteed. 


2. The setup time may have a negative value in which case the minimum limit 
defines the longest interval (between the active transition andthe application 
of the other signal) for which correct operation of the digital circuit is 
guaranteed. 


tt 
Transition time (general) 
The time interval between two referencepoints (10% and 90% unlessotherwise 
specified) on a waveform that is changing from the defined low level to the 
defined high level (rise time) or from the defined high level to the defined low 
level (fall time). 


tw 
Pulse duration (width) 
The time interval between specified reference points on the leading and trailing 
edges of the pulse waveform. 


VIH 
High-level input voltage 
An input voltage within the more positive (less negative) of the two ranges of 
values used to represent the binary variables. 
Note: 
A minimum is specified that is the least-positive value of high-level input 
voltage for which operation of the logic element within specification limits 
is guaranteed. 


VIL 
Low-level input voltage 
An input voltage level within the less positive (more negative) of the two ranges 
of values used to represent the binary variables. 
Note: 
A maximum is specified that is the most-positive value of low-level input 
voltage for which operation of the logic element within specification limits 
is guaranteed. 


VOH 
High-level output voltage 
The voltage at an output terminal with input conditions applied that, according 
to the product specification, will establish a high level at the output. 


VOL 
Low-level output voltage 
The voltage at an output terminal with input conditions applied that, according 
to the product specification, will establish a low level at the output. 


VT + 
Positive-going threshold level 
The voltage level at a transition-operated input that causes operation of the logic 
element according to specification as the input voltage rises from a level below 
the negative-going threshold voltage, VT _ . 


VT - 
Negative-going threshold level 
The voltage level at atransition-operated input that causesoperation of the logic 
element according to specification as the input voltage falls from a level above 
the positive-going threshold voltage, VT + . 


TIMING 
CLOADt 
'OL 
'OH 
VLOAD 
PARAMETERS 
(pF) 
(mA) 
(mA) 
(V) 


tpZH 
0 
ten 
tpZL 
50 
8 
-8 
3 


tdis 
tpHZ 
-8 
tpLZ 
50 
8 
1.5 


tnd 
50 
8 
-8 
; 


t CLOAD Includes probes and test fixture capacitance. 


t VLOAD - VOL 
= 50 Q. where VOL = 0.6 V. IOL = 8 mA. 


IOL 


Timing 
~ 


3V 


Input 
(See Note A) 
OV 
th 
3V 


OV 
tr -+! 
j4- 
tf 


VOLTAGE 
WAVEFORMS 
SETUP AND HOLD TIMES 
INPUT RISE AND FALL TIMES 


Input 
L 1.5V 
~I ~~;-- 
03 
V 
V 
(See Note A) -.Ii 
1\_. 
_ 


tpd ~ 
r.--+ tpd 


1 
! 
~-- 
VOH 


In-Phase 
1 
l'1.5v 
1 
~1.5V 
Output 
I 
---I......... 
I 
VOL 


~ 
~Itpd 
tpd 
r- 
~I 
\L 
!/:':"; 
V 0 H 


-..... 1.5V 
T 1.5V 
_._--~.--- 
VOL 


~3V 


1 
I 
OV 
~tw--+t 


1 
1 
3V 


~OV 


VOLTAGE WAVEFORMS 
PULSE DURATION 


Output 
Control 
(low-level 
enabling) 
I 
I 
I 
tpZL 
-., 


4------ 
I 


~tPLZ-+! 
14- 


I 
I 
I 
:r----f-t-----3V 


1 
-2.5 V 
I 
VOL+10% 
----- 
VOL 
... 
I 


Waveform 
1 
(See Note B) 


---- 
VOH 
VOH-10% 


-1.5V 


----------- 
OV 


NOTES: A. 
Phase relationships between waveforms were chosen arbitrarily. All input pulses are 
supplied by pulse generators having the following 
characteristics: 
PRR = 
1 MHz, 
Zo = 50 n, tr ~ 6 ns, tf ~ 6 ns. 
B. Waveform 1 is for an output with internal conditions such that the output is low except 


when disabled by the output control. 


C. Waveform 2 is for an otuput with internal conditions such that the output is high except 


when disabled by the output control. For tPLZ and tpHZ, VOL and VOH are specified 
values. 
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SN74ACT8818A 
16·81T MICROSEOUENCER 


• 
CLK-to-Y 
= 24 ns Max (tpd) 


• 
Low-Power 
1-f'm EPIC'" CMOS 


• 
Addresses Selected from Eight Different 
Sources 


• 
Performs Multiway 
Branching, 
Conditional 
Subroutine 
Calls, and Nested Loops 


• 
Large 64-Word 
by 16-bit Stack 


• 
Cascadable 


• 
Addresses Up to 64K Locations of 
Microprogram 
Memory 


description 


The SN74ACT8818A 
microsequencer 
is a 10w-pGwer, high-performance 
microsequencer 
implemented 
in 
TI's ,EPIC· Advanced CMOS technology. 
The 16-bit device addresses up to 64K locations of microprogram 
memory and is compatible 
with the SN74AS890 
microsequencer. 


The' ACT8818A 
performs a range of sequencing operations 
in support of TI' s family 
of building block 
devices and special-purpose 
processors such as the SN74ACT8847 
Floating Point Unit (FPUl. 


The' ACT8818A 
microsequencer 
is designed to control execution 
of microcode 
in a microprogrammed 
system. Basic architecture of such a system usually incorporates at least the microsequencer, 
one or more 
processing elements such as the' ACT8847 
FPU or the SN74ACT8832 
registered ALU, microprogram 
memory, 
microinstruction 
register, and status logic to monitor system states and provide status inputs 
to the microsequencer. 


The' ACT8818A 
combines flexibility and high speed in a microsequencer that performs multiway branching, 
conditional 
subroutine 
calls, nested loops, and a variety 
of other microprogrammable 
operations. 
The 
'ACT8818A 
can also be cascaded for providing additional register/counters 
or addressing capability 
for 
more complex microcoded 
control functions. 


In this microsequencer, several sources are available for microprogram address selection. The primary source 
is the 16-bit microprogram 
counter 
(MPCl, although branch addresses may be input on the two 
16-bit 
address buses, ORA and ORB. An address input on the ORA bus can be pushed on the stack for later 
selection. 
Register/counters 
RCA and RCB can store either branch addresses or loop counts as needed, 
either for branch operations 
or for looping on the stack. 


The selection of address source can be based on external status from the device being controlled, 
so that 
three-way 
or multiway 
branching is supported. 
Once selected, the address which is output on the Y bus 
passes to the microprogram 
memory, and the microinstruction 
from the selected location is clocked into 
the pipeline register at the beginning of the next cycle. 


It is also possible to interrupt the' ACT8818A 
by placing the Y output bus in a high-impedance 
state and 
forcing an interrupt vector on the Y bus. External logic is required to place the bus in high impedance and 
load the interrupt vector. The first microinstruction 
of the interrupt handler subroutine can push the address 
from the Interrupt 
Return register on the stack so that proper linkage is preserved for the return from 
subroutine. 


Microinstructions 
for the' ACT8818A 
select the specific operations performed by the Y output multiplexer, 
the register/counters 
RCA and RCB, the stack, and the bidirectional ORA and ORBbuses. Each set of inputs 
is represented as a separate field in the microinstructions, 
which control not only the microsequencer 
but 
also the ALU or other devices in the system. 


The 
3-port 
architecture 
of the' 
ACT8818A 
facilitates 
both 
branch 
addressing 
and register/counter 
operations. Both register/counters 
can be used to hold either loop counts or branch addresses loaded from 
the ORA and ORB buses. Register/counter 
operations are selected by control 
inputs RC2-RtO. 


PRODUCTION DATA documenls 
conlain 
informalion 
current as of publication date. Products conform to 
specifications 
per the terms 
of Texas 
Instruments 
standard 
warranty. 
Production 
processing 
does not 
necessarily 
include testing 
of all parameters. 
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description 
(continued) 


Similarly, 
the 65-word 
by 16-bit stack can save addresses from the ORA bus, the microprogram 
counter 


(MPCl. or the Interrupt Return register. depending on the settings of stack controls 52-SO and related control 
inputs. Flexible instructions 
such as Branch ORA else Branch to Stack else Continue can be coded to take 


advantage 
of the conditional 
branching 
capability 
of the' 
ACT8818A. 


Multiway 
branching 
(16- or 32-way) 
uses the B3-BO inputs to set up a 16-way 
branch address on ORA 


or ORB by concatenating 
B3-BO with 
the upper 
12 bits of the ORA or ORB bus. The resulting 
branch 


addresses 
ORA' (ORA 15-0RA4::B3-BO) 
and ORB' (ORB15-0RB4::B3-BO) 
are selected 
by the Youtput 
multiplexer 
controls MUX2-MUXO. 
A Branch ORB' else Branch ORA' instruction 
can select up to 32 branch 


addresses, 
as determined 
by the settings 
of B3-BO. 


pin description 


The pin descriptions 
and grid assignments 
for the' ACT8818A 
are given on the following 
pages. The grid 
location 
A 1 is marked for indexing 
purposes. 


GC PACKAGE 


(TOP VIEW) 


2 
3 
4 
5 
6 
7 
8 
9 
10 
11 


A • • • • • • • • • • 


B • • • • • • • • • • • 


C • • • 
• • • 
• • • 


0 • • 
• • • 


E • • • 
• • • 


F • • • 
,. • • 


G • • • 
• • • 


H • • 
• • 


J • • • 
• • 
• • • • 


K 
• • • • • • • • • • • 


L 
• • • • • • • • • 
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50 
51 
52 
CC 
5TKWRN/RER 
ZEROUT 


CLK 
MUXO 
MUX1 
MUX2 
BO 
B1 


B2 


B3 
INT 
INC 
COUT 
RE 
GNO 
\fec 
ZEROIN 


FN PACKAGE 


(TOP VIEW) 


0 
NM 
<:t 
LOCC 


uJo 
0 
N M 
<:t 
LO 
CO r--IW 
<X) 
OJ 
~ 0 
« « « « « « « «0«« 
«0 
« « « « «-l 
(/)Z 
cc cc cc cc cc cc cc CC«CCCC 
ccZ 
cc cc cc cc cc W 
Ot:) 0 0 0 0 0 0 0 Occoo 
ot:) 0 00 
0 O(/) 


111098 
7 
6 
5 
4 
3 
2 
184838281807978777675 


74 


73 


72 


71 


70 


69 


68 
67 


66 


65 
64 
63 
62 


61 


60 
59 


58 
57 


56 


55 


32 
54 


33 34 35 36 37 38 394041 
42 43 44 45 46 474849 
50 51 52 53 


VCC 
ORBO 
ORB1 
ORB2 
ORB3 
ORB4 
GNO 
ORB5 
ORB6 
ORB7 
RBOE 
ORB8 
ORB9 
ORB10 
ORB11 
ORB12 
ORB13 
ORB14 
ORB15 
RCO 
RC1 
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PIN 
PIN 
PIN 
PIN 


NO. 
NAME 
NO. 
NAME 
NO. 
NAME 
NO. 
NAME 


A2 
RC2 
C2 
RCO 
F3 
RBOE 
J10 
Sl 


A3 
Y1 
C3 
GND 
F9 
BO 
J11 
STKWRN/RER 


A4 
Y3 
C5 
GND 
F10 
Bl 
Kl 
DRBO 


A5 
Y5 
C6 
Y7 
Fll 
MUX2 
K2 
SELDR 


A6 
Y6 
C7 
Yl0 
Gl 
DRB6 
K3 
DRA14 


A7 
Y8 
C9 
GND 
G2 
DRB5 
K4 
DRA12 


A8 
Yll 
Cl0 
VCC 
G3 
GND 
K5 
DRA10 


A9 
Y13 
Cl1 
RE 
G9 
CLK 
K6 
DRA7 


Al0 
NC 
Dl 
DRB12 
Gl0 
MUXO 
K7 
DRA5 


Bl 
DRB15 
D2 
DRB13 
Gll 
MUX1 
K8 
DRA3 


B2 
RCl 
D9 
GND 
Hl 
DRB4 
K9 
DRAO 


B3 
YO 
Dl0 
COUT 
H2 
DRB3 
Kl0 
SO 


B4 
Y2 
Dll 
INC 
Hl0 
CC 
Kll 
52 


85 
Y4 
El 
DR89 
Hll 
ZEROUT 
L2 
DRA15 


86 
YOE 
E2 
DR8l0 
Jl 
DRB2 
L3 
DRA13 


B7 
Y9 
E3 
DR8ll 
J2 
DRBl 
L4 
DRAll 


88 
Y12 
E9 
INT 
J3 
VCC 
L5 
DRA9 


89 
Y14 
E10 
B3 
J5 
GND 
L6 
DRA8 
Bl0 
Y15 
Ell 
82 
J6 
RAOE 
L7 
DRA6 


B1l 
ZEROIN 
Fl 
DRB7 
J8 
DRAl 
L8 
DRA4 


Cl 
DRB14 
F2 
DRB8 
J9 
GND 
L9 
DRA2 


Ll0 
OSEL 
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MASTER 
CLOCK 


STACK 
• iISTACK 
CONTROL 
WARNING 
FLAG 


• 
ZERO DETECT 


ORA/ORB INPUT 
Y·BUS OUTPUT 
....• 


MUX SELECT 
ORA OUTPUT 
....• 


ORA OUTPUT 
EN 
ORB OUTPUT 


..... 


MUX SELECT 
INT RT REG 


...•.. 


• 
~ I REGISTER/COUNTERS 
INCREMENTER 
• 
2 
A AND B CONTROLS 
CARRY·OUT 


INCREMENTER 
- 


•...•. 
CONTROL 
-. 
CONDITION 
CODE 
INT RT 
...•.• 
: IBRANCH 
ADDRESS 


MUX CONTROL 


•• 
3 
' 


• 
~ I Y·OUTPUT 
ZERO SET 


• 
2 
MUX CONTROLS 
'I 
r 
- ~ 
0 
...•.. 


< 
) 
~ 


• 
• 
- 
• 
• 
DATA 
...• 
• 
• 
• 
...•.. 
15 
. 
• 
I 
CURRENT 
• - 
ADDRESS 
...••.. 
II~ 


.. 
·•• 
..•.. 


YOE 
RAOE 
RBOE 
RE 
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RC2-RCO 


ZEROIN 


ZEROUT 


TEXAS ~ 
INSTRUMENTS 


SN74ACT8818A 
16·BIT MICRO SEQUENCER 


PIN 


NAME 
FN 
GC 
110 
DESCRIPTION 


NO. 
NO. 


BO 
22 
F9 


Bl 
23 
Fl0 
1 
Input bits for branch addressing 
(see Table 31 


B2 
24 
Ell 


B3 
25 
El0 


CLK 
18 
G9 
System 
clock 


COUT 
28 
Dl0 
0 
Incrementer carry-out. Goes high when an attempt is made to increment microprogram counter beyond 


addressable 
micromemory. 


CC 
15 
Hl0 
1 
Condition 
code 


DRAO 
9 
K9 


DRAl 
8 
J8 


DRA2 
7 
L9 


DRA3 
6 
K8 


DRA4 
5 
L8 


DRA5 
4 
K7 


DRA6 
3 
L7 


DRA7 
2 
K6 
Bidirectional DRA data port. Outputs data from stack or register/counter 
A (RADE = 0) or inputs external 


I/O 
data (RAOE = 1I. 
DRA2 
7 
L9 


DRA9 
83 
L5 


DRA10 
82 
K5 


DRAll 
80 
L4 


DRA12 
79 
K4 


DRA13 
78 
L3 


DRA14 
77 
K3 


DRA15 
76 
L2 


DRBO 
73 
Kl 


DRBl 
72 
J2 


DRB2 
71 
Jl 


DRB3 
70 
H2 


DRB4 
69 
Hl 
Outputs 
data from register/counter 
B (RBOE = 01 or inputs external 
data 
G2 
I/O 
Bidirectional 
DRB data port. 
DRB5 
67 


DRB6 
66 
Gl 


DRB7 
65 
Fl 


DRBB 
63 
F2 


DRB10 
61 
E2 


DRBll 
60 
E3 


DRB12 
59 
Dl 
(RBOE =01 
Bidirectional 
DRB data 
port. 
Outputs 
data 
from 
register/counter 
B 
or inputs 
external 


DRB13 
5B 
D2 
I/O 
data (RBOE = 1). 
DRB14 
57 
Cl 


DRB15 
56 
Bl 


GND 
10 
C3 


GND 
30 
C5 


GND 
33 
C9 


GND 
46 
D9 
Ground pins. All pins must be used. 


GND 
52 
G3 


GND 
68 
J5 


GND 
81 
J9 


INC 
27 
Dll 
I 
Incrementer control Din 
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PIN 


NAME 
FN 
GC 
\/0 
DESCRIPTION 


NO. 
NO. 


INT 
26 
E9 
I 
Selects INT RT register to stack, active low (see Table 31 


MUXO 
9 
G10 


MUX1 
20 
G11 
I 
MUX control for Y output bus (see Table 41 


MUX2 
21 
F11 


OSEL 
11 
L10 
I 
ORA output MUX select. Low selects RCA. high selects stack. 


RAOE 
1 
J6 
I 
ORA output enable, active low 


RBOE 
64 
F3 
I 
ORB output enable, active low 


RCO 
55 
C2 


RC1 
54 
B2 
I 
Controls 
for 
register/counters 
A and 
B 


RC2 
53 
A2 


RE 
29 
C11 
INT RT register enable. active low. A high input holds INT RT register while a low input passes Y to 


INRT register (see Table 31. 


SO 
12 
Kl0 


Sl 
13 
J10 
I 
Stack controls 


S2 
14 
Kl1 


SELDR 
75 
K2 
I 
Selects data source to ORA bus and ORB bus (See Table 31 


STKWRNI 
Stack warning signal flag 
16 
J11 
0 
RER 


VCC 
31 
C10 


VCC 
74 
J3 
Supply voltage (5 VI 


YO 
51 
B3 


Y1 
50 
A3 


Y2 
49 
B4 


Y3 
48 
A4 


Y4 
47 
B5 


Y5 
45 
A5 


Y6 
44 
A6 


Y7 
43 
C6 
1/0 
Bidirectional Y data port 


Y8 
41 
A7 


Y9 
40 
B7 


Y10 
39 
C7 


Yl1 
38 
A8 


Y12 
37 
B8 


Y13 
36 
A9 


Y14 
35 
B9 


Y15 
34 
Bl0 


YOE 
42 
B6 
I 
Y output 
enable, 
active 
low 


ZEROIN 
32 
B11 
I 
Forces internal zero detect high 


ZEROUT 
17 
Hl1 
0 
Outputs register/counter zero detect signal 


design 
support 


TI's 'ACT8818A 
16-bit microsequencer 
is supported 
by a variety 
of tools developed to aid in design 
evaluation and verification. 
These tools will streamline all stages of the design process, from assessing 
the operation and performance of the' ACT8818A to evaluating a total system application. The tools include 
a functional 
model, behavioral model, and microcode development software 
and hardware. Section 9 of 
this manual provides specific information 
on the design tools supporting TI's '8800 
Family. 
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systems expertise 


Texas Instruments Datapath VLSI Products Systems Engineering group is available to help designers analyze 
TI's high-performance 
VLSI products, such as the' ACT8818A 
15-bit microsequencer. 
The group works 
directly with designers to provide ready answers to device-related 
questions and also prepares a variety 
of applications 
documentation. 


The group may be reached in Dallas, at (214) 997-3970. 


architecture 


The' ACT8818A 
microsequencer 
is designed with a 3-port architecture 
similar to the bipolar SN74AS890 
microsequencer. 
The functional 
block diagram shows the architecture 
of the 'ACT8818A. 
The device 
consists 
of the following 
principal functional 
groups: 


1. A 15-bit microprogram 
counter (MPC) consisting 
of a register and incrementer 
which generates 
the next sequential microprogram 
address 


2. 
Two register/counters 
(RCA and RCB) for counting loops and iterations, storing branch addresses, 
or driving external devices 


3. 
A 55-word 
by 15-bit LIFO stack that allows subroutine calls and interrupts 
at the microprogram 
level and is expandable and readable by external hardware 


4. 
An interrupt return register and Y output enable for interrupt processing at the microinstruction 
level 


5. 
A Y output multiplexer by which the next address can be selected from MPC, RCA, RCB, external 
buses DRA and DRB, or the stack. 


'ACT8818A 
control signals are summarized in Table 1. Those signals, which typically 
originate from the 
instruction 
register, are Y output multiplexer 
controls, MUX2-MUXO. 
These select the source of the next 
address; stack operation controls, S2-SO; register/counter operation controls, RC2-RCO;OSEL, which allows 
the stack to be read for diagnostics; 
input MUX select, SELDR; DRA and DRB output enables, RAOE and 
RBOE; and INT, used during the first cycle of interrupt service routines to push the address in the interrupt 
return register address onto the stack. 


Control and data signals that commonly originate from the microinstruction 
and from other hardware sources 
include INC, which determines whether to increment the MPC; DRA and DRB, used to load or read loop 
counters and/or next addresses; and CC, the condition code input. The address being loaded into the MPC 
is not incremented if INC is low, allowing wait states and repeat until flag instructions 
to be implemented. 
If INC originates from status, 
repeat until flag instructions 
are possible. 


The condition code input CC typically 
originates from ALU status to permit test and branch instructions. 
However, 
it must also be asserted under microprogram 
control to implement 
other instructions 
such as 
continue or loop. Therefore, CC will generally be controlled 
by the output of a status multiplexer. 
In this 
case, whether CC is to be forced high, forced low, or taken from ALU status will be determined by a status 
MUX select field in the microinstruction. 
Control signals that may also originate from hardware are B3-BO, which can be used as a 4-bit status input 
to support 
15- and 32-way 
branches, and YOE, which 
allows interrupt 
hardware to force an interrupt 
vector on the microaddress 
bus. 


Status from the' ACT8818A 
is provided by ZEROUT, which is set at the beginning of a cycle in which 
either of the register/counters 
will decrement to zero, and STKWRN/RER, set at the beginning of the cycle 
in which the bottom 
of stack is read or in which the next to last location is written. 
In the latter case, 
STKWRN/RER remains high until the stack pointer is decremented 
from 54 to 53. 
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SIGNAL 
lOGIC lEVEL 


NAME 
HIGH 
lOW 


BOt 
load stack pointer from 7 least significant bits of ORA 
No effect 


Blt 
Selects ORA contents as stack input (takes priority over INT) 
No effect 


CC 
Condition code input. May be microcoded or selected from external 
Condition code input. For branch operations, 


status 
results. 
low active. 


INC 
Increment address from Y bus and load into MPC 
Pass address from Y bus to MPC 


unincremented. 


INTt 
Selects MPC as input to stack 
Selects 
interrupt 
return 
register 
as input 
to 


stack 


OSEl 
Selects stack as output from ORA output MUX 
Selects RCA as output from ORA output MUX 


MUX2-MUXO 
See Table 4 
See Table 4 


RAOE 
ORA output disabled (high-Z) 
ORA output enabled 


RBOE 
ORBoutput disabled (high-Z) 
ORBoutput enabled 


RC2-RCO 
See Table 6 
See Table 6 


RE 
Hold 
interrupt 
return 
register 
contents 
load address on Y bus to interrupt return 


register 


S2-S0 
See Table 5 
See Table 5 


SElOR 
Selects ORA/ORBexternal data as inputs to ORA/ORBbuses 
Selects RCA (OSEl low) or stack (OSEl high) 


to ORA bus, RCBto ORB bus 


YOE 
Y output disabled (high-Zl 
Y output enabled 


ZEROIN 
Sets ZEROUTto a high externally to set up conditional branch 
No effect 


tNo control effect when ORA' or ORB' selected (MUX2-MUXO) 
HlH) because B3-BO are address inputs. 
tWhen Bl is low or Bl is not in control mode. 


Y output multiplexer 


Address selection is controlled 
by the Y output 
multiplexer 
and the RAOE and RBOE enables, Addresses 


can be selected from eight sources: 


1. 
The microprogram 
counter register, used for repeat ONC off) and continue 
ONC on) instructions 


2. 
The stack, which supports subroutine calls and returns as well as iterative loops and returns from 
interrupts 


3. 
The ORA and ORB ports, which provide two additional paths from external hardware 
by which 


microprogram 
addresses can be generated 


4. 
Register counters 
RCA and RCB, which 
can be used for additional 
address storage 


5. 
B3-BO, whose contents 
can replace the four least significant 
bits of the ORA and ORB buses 


to support 
16-way 
and 32-way 
branches 


6. 
An external 
input onto the bidirectional 
Y port to support 
external 
interrupts. 


Use of controls MUX2-MUXO is explained further in the later section on microprogramming 
the' ACT8818A. 
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microprogram 
counter 


Based on system status and the current instruction, the microsequencer outputs the next execution address 
in the microprogram. Usually the incrementer adds one to the address on the Y bus to compute next address 
plus one. Next address plus one is stored in the microprogram 
register at the beginning of the subsequent 
instruction 
cycle. During the next instruction, 
this 'continue' 
address will be ready at the Y output 
MUX 
for possible selection as the source of the subsequent instruction. The incrementer thus looks two addresses 
ahead of the address in the instruction 
register to set up a continue 
(increment 
by one) or repeat (no 
increment) 
address. 


Selecting INC from status is a convenient 
means of implementing 
instructions 
that must repeat until SOine 
condition 
is satisfied; 
for example, Shift ALU Until MSB = 1, or Decrement ALU Until Zero. The MPC is 
also the standard path to the stack. The next address is pushed onto the stack during a subroutine 
call, 


so that the subroutine 
will return to the instruction 
following 
that from which 
it was called. 


register/counters 


Addresses or loop counts may be loaded directly into register/counters 
RCA and RCB through the direct 


data ports DRA 15-DRAO and DRB15-DRBO. The values stored in these registers 
may either be held, 
decremented, 
or read. Independent control of both the registers during a single cycle is supported 
with 


the exception 
of a simultaneous 
decrement 
of both registers. 


The positive-edge-clocked 
16-bit address stack allows multiple levels of nested calls or interrupts 
and can 
be used to support 
branching and looping. Seven stack operations 
are possible: 


1. 
Reset, which pulls all Y outputs 
low and clears the stack pointer and read pointer 


2. 
Clear, which sets the stack pointer and read pointer to zero 


3. 
Pop, which causes the stack pointer to be decremented 


4. 
Push, which puts the contents 
of the MPC, interrupt 
return register, or DRA bus onto the stack 
and increments 
the stack pointer 


5. 
Read, which 
makes the address indicated 
by the read pointer available at the DRA port 


6. 
Hold, which causes the address of the stack and read pointers to remain unchanged 


7. 
Load stack pointer, 
which 
inputs the seven least significant 
bits of DRA to the stack pointer. 


stack 
pointer 


The stack 
pointer 
(SP) operates 
as an up/down 
counter; 
it increments 
whenever 
a push occurs 
and 
decrements whenever a pop occurs. Although push and pop are two event operations (store then increment 
SP, or decrement 
SP then read), the' ACT8818A 
performs 
both events within 
a single cycle. 


read 
pointer 


The read pointer (RP) is provided as a tool for debugging microcoded systems. It permits a nondestructive, 
sequential read of the stack contents from the DRA port. This capability provides the user with a method 
of backtracking through the address sequence to determine the cause of overflow without affecting program 
flow, 
the status of the stack pointer, 
or the internal data of the stack. 
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stack warning/read error pin 


A high signal on the STKWRN/RER pin indicates 
a potential 
stack 
overflow 
or underflow 
condition. 


STKWRN/RER becomes active under two conditions. 
If 62 of the 65 stack locations 
(0-64) are full (the 


stack pointer is at 62) and a push occurs, the STKWRN/RER pin outputs 
a high signal to warn that the 


stack is approaching 
its capacity 
and will be full after two 
more pushes. 


The STKWRN/RER signal will remain high if hold, push, or pop instructions 
occur, until the stack pointer 


is decremented 
to 62. If a push instruction 
is attempted 
when the stack is full, the new address will be 


ignored and the old address in stack location 
64 will be retained. 


The STKWRN/RER pin will go high when the stack pointer is less than or equal to one and a pop or read 
from stack is coded on the S2-S0 pins. The pin will go high after reading the next to the bottom 
stack 
address (1). When the S2-S0 pins are set to pop or read the last address (0) or to pop or read an empty 
stack, the STKWRN/RER pin will go high. The pin depends only on the setting of the S2-S0 pins and the 
stack pointer, 
not on the clock. 


interrupt return register 


Unlike the MPC register, which normally gets next address plus one, the interrupt 
return register simply 


gets next address. This permits interrupts to be serviced with zero latency, since the interrupt vector replaces 
the pending address. 


The interrupting 
hardware disables the Y output 
and forces the vector onto the microaddress 
bus. This 


event must be synchronized with the system clock. The first address of the service routine must program 
INT low and perform a push to put the contents 
of the interrupt 
return register on the stack. 


absolute maximum ratings over operating free air temperature range (unless otherwise notedlt 


Supply voltage, 
VCC (see Note 1) 
- 0.5 V to 6 V 


Input clamp current, 
11K(VI < 0 or VI> VCC) 
± 20 mA 


Output 
clamp current, 
10K (VO<O 
or VO>VCC 
±50 
mA 


Continuous 
output 
current, 
10 (VO = 0 to VCC) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. 
± 50 mA 


Continuous 
current through 
VCC or GND pins. 
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. 
± 100 mA 


Operating free-air temperature 
range. 
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. 
O°C to 70°C 


Storage temperature 
range . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. 
65°C 
to 150°C 


tStresses beyond those listed under "absolute maximum ratings" may cause permanent damage to the device. These are stress ratings 
only and functional operation of the device at these or any other conditions beyond those indicated under "recommended operating 
conditions" 
is not implied. Exposure to absolute maximum rated conditions for extended periods may affect device reliability. 


NOTE 1: All voltage levels are with respect to GND. 


MIN 
NOM 
MAX 
UNIT 


VCC 
Supply voltage 
4.75 
5 
5.25 
V 


VIH 
High-level input voltage 
2 
VrC 
V 


VIL 
Low-level input voltage 
0 
0.8 
V 


IOH 
High-level output current 
-8 
mA 


IOL 
Low-level output current 
8 
mA 


VI 
Input voltage 
0 
VCC 
V 


Vo 
Output voltage 
0 
VCC 
V 


dt/dv 
Input transition rise or fall rate 
0 
15 
nslV 


TA 
Operating free~airtemperature 
0 
70 
°c 


TEXAS ~ 
INSTRUMENTS 


electrical characteristics over recommended operating free-air temperature range (un)ess·O'iFierwrse 
noted) 


PARAMETER 


. 
TEST CONDITIONS 
vcc 
MIN 
Typt 
MAX 
UNIT 


4.75 V 
4.6 
10H = 
-20 
p.A 


5.25 V 
5.1 
VOH 
V 


4.75 V 
3.85 
3.95 
10H = 
-8 
mA 
5.25 V 
4.6 
4.7 


4.75 V 
0.1 
JOl = 20 p.A 
5.25 V 
0.1 
VOL 
V 


4.75 V 
0.32 
0.45 
10l = 8 mA 
5.25 V 
0.32 
0.45 


lit 
VI = VCC or 0 
5.25 V 
±0.1 
±5 
p.A 


ICCQ 
VI = VCC or 0 
5.25 V 
100 
200 
p.A 


CI 
VI = VCC or 0 
5V 
10 
pF 


~ICC§ 
One input at 3.4 V, other inputs at 0 or VCC 
5.25 V 
1 
mA 


t All typical values are at VCC = 5 V, TA = 25°C. 
tFor I/O ports, the parameter 10ZH and 10Zl include the off state output current. 
§This is the increase in supply current for each input that is at one of the specified TTl voltage levels rather than 0 V or VCC. 


maximum 
switching 
characteristics 
over recommended 
operating supply voltage 
and free-air 


temperature range (unless otherwise noted)' 


TO 
FROM 
PARAMETER 
(OUTPUT) 
UNIT 
(INPUT 
Y 
ZEROUT 
ORA 
STKWRN 
ORB 
COUT 


CC 
17 
ns 
22 
18 
12 
19 
ClK 
ns 
24# 
17# 


DRA15-DRAO 
17 
DRB15-DRBO 
17 


MUX2-MUXO 
17 


tpd 
RC2-RCO 
19 
14 
52-SO 
18 
14 
B3-BO 
14 
ns 
05El 
18 
15 
ZEROIN 
18 


5ElDR 
16 


INC 
15 
Y 
13 
YOE 
16 


ten 
RAOE 
18 
ns 
RBOE 
17 


YOE 
12 


tdis 
RAOE 
12 
ns 
RBOE 
12 


1See Parameter Measurement 
Information for load circuit and voltage waveforms. 
#Decrementing 
register/counter A or 8 and sensing a zero, 
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timing requirements over recommended 
ranges of supply voltage and operating free-air temperature 
(unless otherwise 
noted) 


setup and hold times 


PARAMETER 
MIN 
MAX 
UNIT 


Setup time, from CC to CLK, destination is stack 
11 


Setup time, from ORA15-ORAO to CLK1, destination is stack 
9 


Setup time, from ORA15-ORAO to CLK1, destination is RCA 
6 


Setup time, from ORA15-ORAOto CLK1, destination is INT RT 
11 


Setup time, from ORS15-ORSOto CLKi, destination is RCS 
7 


Setup time, from ORS15-ORSOto CLKi, destination is INT RT 
11 


Setup time, from INC to CLKi, destination is MPC 
11 


Setup time, from INT to CLKi, destination is stack 
7 


Setup time, from RC2-RCOto CLKi, destination is stack 
7 


Setup time, from RC2-RCOto CLKi, destination is RCA or RCS 
14 


Setup time, from RC2-RCOto CLKi, destination is INT RT 
6 


Setup time, from S2-S0 to CLKi, destination is stack 
11 
tsu 
ns 
Setup time, from S2-S0 to CLKi, destination is INT RT 
11 


Setup time, from OSEL to CLKi, destination is stack 
10 


Setup time, from OSEL to CLKi, destination is INT RT 
10 


Setup time, from S3-S0 to CLKi, destination is stack 
8 


Setup time, from S3-S0 to CLKi, destination is INT RT 
11 


Setup time, from SELORto CLKi, destination is stack 
8 


Setup time, from SELORto CLKi, destination is INT RT 
8 


Setup time, from ZEROINto CLKi, destination is stack 
12 


Setup time, from ZEROINto CLKi, destination is INT RT 
10 


Setup time, from Y to CLKi, destination is MPC 
6 


Setup time, from REto CLKi, destination is INT RT 
6 


Setup time, from MUX2-MUXO to CLKi, destination is INT RT 
9 


th 
Hold time, any input after CLKi 
0 
ns 


PARAMETER 
MIN 
MAX 
UNIT 


tw1 
Pulse duration, clock low 
8 
ns 


tw2 
Pulse duration, clock high 
8 
ns 


tc 
Clock cycle time 
25 
ns 
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TIMING 
CLOADt 
IOL 
IOH 
VLOAD 
PARAMETERS 
(pF) 
(mA) 
(mA) 
(V) 


tpZH 
0 
ten 
tpZL 
50 
8 
-8 
3 


tdls 
tpHZ 
-8 
tpLZ 
50 
8 
1.5 


tl'ld 
50 
8 
-8 
:I: 


t CLOAD includes probes and test fixture capacitance. 


:\:VLOAD - VOL 
= 50 Q, where VOL = 0.6 V, IOL = 8 mA. 


IOL 


Timing 
-I 
\:""- 


(See N~~:~; -----4~_ 
~5 ~ 
__ 
\... 


tsu~th 


Data 
~.7VI 
2.7V 
--- 
Input 
1.5 V 
_J 
1.5 V 
0.3V 
0.3 VI 
tr ~ 
i+--- 
41 I+-- tf 


VOLTAGE WAVEFORMS 
SETUP AND HOLD TIMES 
INPUT RISE AND FALL TIMES 


~--------- 
3V 
Input 
L 1.5V 
'\. 
1.5V 


(See Note A) --Ii 
I ---- 
0 V 


tpd -l4------+i 
~ 
tpd 
I J 
! 
1-- 
VOH 
In· Phase 
1 
1.5 vi 
\L 
~.5 V 
Output 
. 
I 
'\..::.. 
V 
I 
OL 
I~ 
., 
I4--¥- 
tpd 


tpd~ 
1 
\L 
!I:':': VOH 
\. 
1.5V 
T 1.5V 


-. ---- 
.•- 
- 
- 
VOL 


Out·of·Phase 
Output 


~ 


--3V 


1.5 V 
1.5 V 


I 
! 
OV 
I+-tw-.l 
I 
i 


~ 


I 
3V 


1.5V 
1.5V 
____ 
OV 


VOLTAGE WAVEFORMS 
PULSE DURATION 


Output 
Control 
(Iow·level 
enabling) 


Waveform 
1 
(See Note B) 


1.5V 


~---- 
OV 
tPZL-.J 
~ 
1 
I 
I 
tPLZ-+j 
I•... 


1 \1 
__ 
L1 
3V 
1 
1I,r-- 
-1.5V 


11.5 
V 
l"'f 
VOL + 0.3 V 
_.--_._- 
-- 
- 
- 
VOL 
1 
tPHZ-.I 
I+- 


tpZH -+I 
I.... 
I 
I ------ 
- 
- 
- 
VOH 
1 
~v 
____ 
~~.~------ 
~~5V 
Waveform 
2 
(See Note C) 


Notes: 
A. Phase relationships between waveforms were chosen arbitrarily. All input pulses are supplied by pulse generators having the following 
characteristics: 
PRR = 1 MHz, Zo = 50 Q, tr •• 6 ns, tf" 
6 ns. 
B. Waveform 
1 is for an output with internal conditions such that the output is low except when disabled by the output control. 


C. Waveform 2 is for an output with internal conditions such that the output is high except when disabled by the output control. For tpLZ 
and tPHZ. VOL and VOH are specified values. 


TEXAS ~ 
INSTRUMENlS 


SN74ACT8818A 
16·BIT MICROSEOUENCER 


microprogramming the 'ACT8818A 


Microprogramming 
is unlike programming 
monolithic 
processors 
for several reasons. 
First, the width 
of 
the microinstuction 
word is only partially constrained 
by the basic signals required to control the sequencer. 


Since the main advantage of a microprogrammed 
processor is speed, many operations 
are often supported 
by or carried out in special purpose hardware. 
Lookup tables, extra registers, 
address generators, 
elastic 
memories, 
and data acquisition 
circuits 
may also be controlled 
by the microinstruction. 


The number of slices in a bit-slice ALU is user-defined, 
which makes the microinstruction 
width 
even more 
application 
dependent. 
Types of instructions 
resulting 
from manipulation 
of the sequencer 
controls 
are 
discussed 
below. 
Examples 
of some commonly 
used instructions 
can be found 
in the later section 
of 
microinstructions 
and flow 
diagrams. 
The following 
abbreviations 
are used in the tables in this section: 


BR A 
BR A' 
BR B 
BR B' 
BR S 
CALL A 
CALL B 
CALL A' 
CALL B' 
CALL S 
CLR SP, RP 
CONT/RPT 
ORA 
ORA' 
ORB 
ORB' 
MPC 
POP 
PUSH 
RCA 
RCB 
REAO 
RESET 
RP 
SP 
STK 


Y - 
ORA 
Y - 
ORA' 


Y - 
ORB 
Y - 
ORB' 
Y 
STK 
Y - 
ORA; STK - 
MPC; SP - 
SP + 1; RP - 
RP + 1 
Y - 
ORB; STK - 
MPC; SP - 
SP + 1; RP - 
RP + 1 
Y - 
ORA'; 
STK - 
MPC; SP - 
SP + 1; RP - 
RP + 1 
Y - 
ORB'; STK - 
MPC; SP - 
SP + 1; RP - 
RP + 1 
Y - 
STK; STK - 
MPC; SP - 
SP + 
1; RP - 
RP + 
1 
SP - 
0; RP - 
points to TOS register 
Y - 
MPC + 1 if INC = H; Y - 
MPC if INC = L 
Bidirectional 
data port (can be loaded externally 
or from 
RCA) 
ORA 15-0RA4::B3-BO 
Bidirectional 
data port (can be loaded externally 
or from 
RCB) 
ORB15-0RB4::B3-BO 
Microprogram 
counter 
SP - 
SP - 
1; RP - 
RP - 
1 
STK - 
operand; 
SP - 
SP + 1; RP - 
RP + 
1 
Register/counter 
A 
Register/counter 
B 
ORA - 
STK; RP - 
RP - 
1; SP - 
SP - 
1 
Y - 
0; SP - 
0; RP - 
points to TOS register 
Read pointer 
Stack pointer 
Stack 


address selection 


V-output 
multiplexer 
controls 
MUX2-MUXO 
select one of eight 3-source 
branches 
as shown 
in Table 2. 
The states of CC and ZERO determine 
which 
of the three sources is selected 
as the next address. ZERO 


is set at the beginning of any cycle in which a register/counter 
will decrement 
to zero. This applies to both 
internal 
ZERO and external 
ZEROUT signals. 
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V OUTPUT SOURCE 
MUX2- 
RESET 
CC - 
L 
CC - 
H 
MUXO 
ZERO - 
L 
ZERO - 
H 


XXX 
Ves 
All Low 
All Low 
All Low 


LLL 
No 
STK 
MPC 
ORA 


LLH 
No 
STK 
MPC 
ORB 


LHL 
No 
STK 
ORA 
MPC 


LHH 
No 
STK 
ORB 
MPC 


HLL 
No 
ORA 
MPC 
ORB 


HLH 
No 
ORA't 
MPC 
ORB't 


HHL 
No 
ORA 
STK 
MPC 


HHH 
No 
ORB 
STK 
MPC 


tORA 15-0RA4::B3-BO 
t ORB15-0RB4:: B3-BO 


By programming 
CC high or low without 
decrementing 
registers, 
only one outcome 
is possible; thus, 
unconditional 
branches or continues can be implemented 
by forcing the condition code. Alternatively, 
CC 
can be selected from status, 
in which 
case Branch A on Condition 
Code Else Branch B instructions 
are 
possible, where A and B are the address sources determined 
by MUX2-MUXO. 


Decrement and Branch on Nonzero instructions, 
creating loops that repeat until a terminal count is reached, 
can be implemented 
by programming 
CC low and decrementing 
a register/counter. 
If CC is selected from 
status and registers are decremented, 
more complex instructions 
such as Exit on Condition 
Code or End 
or Loop are possible. 


When MUX2-MUXO 
= HLH, the B3-BO inputs can replace the four least significant 
bits of DRA or DRB 
to create 
16-way 
branches or, when CC is based on status, 
to create 32-way 
branches. 


stack controls 


As in the case of the MUX controls, 
each stack-control 
coding is a three-way 
choice based on CC and 
ZERO (see Table 3). This allows 
push, 
pop, 
or hold stack 
operations 
to occur 
in parallel 
with 
the 
aforementioned 
branches. A subroutine call is accomplished by combining a branch and push, while returns 
result from coding a branch to stack with 
a pop. 


STACK OPERATION 


S2-S0 
OSEL 
CC - 
L 
CC - 
H 
ZERO - 
L 
ZERO - 
H 


LLL 
X 
Reset/Clear 
Reset/Clear 
Reset/Clear 


LLH 
X 
Clear SP/RP 
Hold 
Hold 


LHL 
X 
Hold 
Pop 
Pop 


LHH 
X 
Pop 
Hold 
Hold 


HLL 
X 
Hold 
Push 
Push 


HLH 
X 
Push 
Hold 
Hold 


HHL 
X 
Push 
Hold 
Push 


HHH 
H 
Read 
Read 
Read 


HHH 
L 
Hold 
Hold 
Hold 
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Combining 
stack and MUX controls 
with 
status 
results 
and register decrements 
permits 
even greater 


complexity. 
For example: 
Return on Condition Code or End of Loop; Call A on Condition Code Else Branch 


to B; Decrement 
and Return on Nonzero; Call 16-Way. 


Diagnostic 
stack dumps are possible using Read (S2-SO = HHH) when OSEL is set high. 


register controls 


Unlike stack and MUX controls, 
register control 
is not dependent 
upon CC and ZERO. Registers can be 


independently 
loaded, decremented, 
or held using register 
control 
inputs 
RC2-RCo (see Table 4). All 


combinations 
are supported with the exception of simultaneous 
register decrements. 
The register control 


inputs can be set to store branch addresses and loop counts or to decrement 
loop counts, facilitating 
the 


complex 
branching 
instructions 
described above. 


REGISTER OPERATIONS 
RC2-RCO 
REG B 
REG A 


LLL 
Hold 
Hold 


LLH 
Decrement 
Hold 


LHL 
Load 
Hold 


LHH 
Decrement 
Load 


HLL 
Load 
Load 


HLH 
Hold 
Decrement 


HHL 
Hold 
Load 


HHH 
Load 
Decrement 


The contents 
of RCA are accessible to the DRA port when OSEL is low and the output 
bus is enabled 


by RAOE being low. 
Data from RCB is available when DRB is enabled by RBOE being low. 


continue/repeat 
instructions 


The most commonly 
used instruction 
is a continue, 
implemented 
by selecting MPC at the Y output 
MUX 


and setting 
INC high. If MPC is selected and INC is off, the current instruction 
will simply be repeated. 


A repeat instruction 
can be implemented 
in two 
ways. A programmed 
repeat lINC forced 
low) may be 


useful in generating wait states, for example, wait for interrupt. 
A conditional 
repeat lINC originates from 


status) may be useful in implementing 
Do While operations. 
Several bit patterns in the MUX control field 


of the microinstruction 
will place MPC on the microaddress 
bus. 


A branch or jump to a given microaddress 
can also be coded several ways. 
RCA, DRA, RCB, DRB, and 


STK are possible sources for branch addresses (see Table 2). Branches to register or stack are useful 
whenever 
the branch address could be stored to reduce overhead. 


The simplest 
branches are to DRA and DRB, since they require only one cycle and the branch address 


is supplied in the microinstruction. 
Use of registers or stack requires an initial load cycle (which may be 


combined with a preceding instruction), 
but may be more practical when an entry point is referenced over 


and over throughout the microprogram, for example, in error-handling routines. Branches to stack or register 
also enhance sequencing techniques in which a branch address is dynamically computed or multiple branches 
to a common entry point are used, but the entry point varies according to the system state. In this case, 
the state change might require reloading the stack or register. 
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In order to force a branch to ORA or ORB, CC must be programmed high or low. A branch to stack is only 
possible when CC is forced 
low (see Table 2). 


When CC is low, the ZERO flag is tested, and if a register decrements to zero, the branch will be transformed 
into a Decrement and Branch on Nonzero instruction. 
Therefore, registers should not be decremented during 
branch instructions 
using CC = 0 unless it is certain the register will not reach terminal count. Call (Branch 
and Push MPC) instructions and Return (Branch to Stack and Pop) instructions are discussed in later sections. 


Perhaps the most useful of all branches is the conditional 
branch. The' ACT8818A 
permits three modes 
of conditional 
branching: 
Branch on Condition 
Code; Branch 16-Way from ORA or ORB; and Branch on 
Condition 
Code 16-Way from ORA Else Branch 16-Way from ORB. This increases the versatility 
of the 
system 
and the speed of processing status tests 
because both single-bit 
and 4-bit status 
are allowed. 


Testing single-bit status is preferred when the status can be set up and selected through 
a status 
MUX 
prior to the conditional 
branch. Four-bit status allows the' ACT8818A 
to process instructions 
based on 
Boolean status expressions, such as Branch if Overflow 
and Not Carry if Zero or if Negative. It also permits 
true n-way branches, such as If Negative then Branch to X, Else if Overflow, 
and Not Carry then Branch 
to Y. The tradeoff 
is speed versus program size. Since multiway 
branching occurs relatively 
infrequently 
in most programs, 
users will enjoy increased 
speed at a negligible 
cost. 
Call (Branch and Push MPC) 
instructions 
and Return (Branch to Stack and Pop) instructions 
are discussed 
in later sections. 


loop instructions 


Up to two 
levels of nested loops are possible when both counters 
are used simultaneously. 
Loop count 
and levels of nesting can be increased by adding external counters if desired. The simplest and most widely 
used of the loop instructions 
is Decrement 
and Branch on Nonzero, in which 
CC is forced 
low while a 
register is decremented. 
As before, manY,forms are possible, since the top-of-Ioop 
address can originate 
from RCA, ORA, RCB, ORB, or the stack (see Table 2). Upon terminal 
count, instruction 
flow 
can either 
drop out of the bottom 
of the loop or branch elsewhere. 


When loops are used in conjunction 
with CC as status, 
B3-BO as status and/or stack manipulation, 
many 
useful instructions 
are possible, including Decrement and Branch on Nonzero else Return, Decrement and 
Call on Nonzero, and Decrement 
and Branch 16-Way on Nonzero. Possible variations 
are summarized 
in 
Table 5. Call (Branch and Push MPC) instructions 
and Return (Branch to Stack and Pop) instructions 
are 
discussed 
in later sections. 


Another 
level of complexity 
is possible if CC is selected from status while looping. This type of loop will 
exit either because CC is true or because a terminal count has been reached. This makes it possible, for 
example, to search the ALU for a bit string. 
If the string is found, the match forces CC high. However, 


if no match is found, 
it is necessary to terminate 
the process when the entire word has been scanned. 
This complex 
process can then be implemented 
in a simple compact 
loop using Conditional 
Decrement 
and Branch on Nonzero. 
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MUX2- 
CC - 
L 
CC - 
H 
SE-SO 
OSEL 
MUXO 
ZERO - 
L 
ZERO - 
H 


LLL 
LLH 
X 
BR S: CLR SP/RP 
CONT/RPT 
BR A 


LLL 
LHL 
X 
BR S 
CONT/RPT: 
POP 
BR A: POP 


LLL 
HLL 
X 
BR S 
CONT/RPT: 
PUSH 
CALL A 


LLL 
HHH 
0 
BR S 
CONT/RPT 
BR A 


LLL 
HHH 
1 
BR S: READ 
CONT/RPT: 
READ 
BR A: READ 


LLH 
LLH 
X 
BR S: CLR SP/RP 
CONT/RPT 
BR B 


LLH 
LHL 
X 
BR S 
CONT/RPT: 
POP 
BR B: POP 


LLH 
HLL 
X 
BR S 
CONT/RPT: 
PUSH 
CALL B 


LLH 
HHH 
0 
BR S 
CONT/RPT 
BR B 


LLH 
HHH 
1 
BR S: READ 
CONT/RPT: 
READ 
BR B: READ 


LHL 
LLH 
X 
BR S: CLR SP/RP 
BR A 
CONT/RPT 


LHL 
LHL 
X 
BR S 
BR A: POP 
CONT/RPT: 
POP 


LHL 
HLL 
X 
BR S 
CALL A 
CONT/RPT: 
PUSH 


LHL 
HHH 
0 
BR S 
BR A 
CONT/RPT 


LHL 
HHH 
1 
BR S: READ 
BR A: READ 
CONT/RPT: 
READ 


LHH 
LLH 
X 
BR S: CLR SP/RP 
BR B 
CONT/RPT 


LHH 
LHL 
X 
BR S 
BR B: POP 
CONT/RPT: 
POP 


LHH 
HLL 
X 
BR S 
CALL B 
CONT/RPT: 
PUSH 


LHH 
HHH 
0 
BR S 
BR B 
CONT/RPT 


LHH 
HHH 
1 
BR S: READ 
BR B: READ 
CONT/RPT: 
READ 


HLL 
LLH 
X 
BR A: CLR SP/RP 
CONT/RPT 
BR B 


HLL 
LHL 
X 
BR A 
CONT/RPT: 
POP 
BR B: POP 


HLL 
LHH 
X 
BR A: POP 
CONT/RPT 
BR B 


HLL 
HLL 
X 
BR A 
CONT/RPT: 
PUSH 
CALL B 


HLL 
HHH 
0 
BR A 
CONT/RPT 
BR B 


HLL 
HHH 
1 
BR A: READ 
CONT/RPT: 
READ 
BR B: READ 


HLH 
LLH 
X 
BR A' (16-wayl: 
CLR SP/RP 
CONT/RPT 
BR B' 116-way) 


HLH 
LHL 
X 
BR A' (16-wayl 
CONT/RPT: 
POP 
BR B' (16-way): 
POP 


HLH 
LHH 
X 
BR A' (16-wayl: 
POP 
CONT/RPT 
BR B' (16-way) 


HLH 
HLL 
X 
BR A' (16-way) 
CONT/RPT: 
PUSH 
CALL B'(16-way) 


HLH 
HHH 
0 
BR A' (16-way) 
CONT/RPT 
BR B' (16-way) 


HLH 
HHH 
1 
BR A' 116-way): 
READ 
CONT/RPT: 
READ 
BR B' (16-way): 
READ 


HHL 
LLH 
X 
BR A: CLR SP/RP 
BR S 
CONT/RPT 


HHL 
LHL 
X 
BR A 
RET IBRS: POPI 
CONT/RPT: 
POP 


HHL 
LHH 
X 
BR A: POP 
BR S 
CONT/RPT 


HHL 
HLL 
X 
BR A 
CALL S 
CONT/RPT: 
PUSH 


HHL 
HHH 
0 
BR A 
BR S 
CONT/RPT 


HHL 
HHH 
1 
BR A: READ 
BR S: READ 
CONT/RPT: 
READ 


HHH 
LLH 
X 
BR B: CLR SP/RP 
BR S 
CONT/RPT 


HHH 
LHL 
X 
BR B 
RET 
CONT/RPT: 
POP 


HHH 
LHH 
X 
BR B: POP 
BR S 
CONT/RPT 


HHH 
HLL 
X 
BR B 
CALL S 
CONT/RPT: 
PUSH 


HHH 
HHH 
0 
BR B 
BR S 
CONT/RPT 


HHH 
HHH 
1 
BR B: READ 
BR S: READ 
CONT/RPT: 
READ 
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subroutine 
calls 


The various 
branch instructions 
described 
above can be merged with 
a push instruction 
to implement 
subroutine calls in a single cycle. Calls, conditional 
calls, and Decrement and Call on Nonzero are the most 
obvious. 


Since a push is conditional 
on CC and ZERO, many hybrid instructions 
are also possible, such as Call X 
on Condition 
Code Else Branch, or Decrement 
and Return on Nonzero Else Branch. Codes that 
cause 


subroutine 
calls are summarized 
in Tables 6 and 7. 


MUX2-MUXO 
52-50 
05EL 
CC - 
L (ZERO - 
L) 
CC - 
H 


LLL 
HLH 
X 
CALL S 
BR A 


LLL 
HHL 
X 
CALL S 
CALL A 


LLH 
HLH 
X 
CALL S 
BR B 


LLH 
HHL 
X 
CALL S 
CALL B 


LHL 
HLH 
X 
CALL S 
CONT/RPT 


LHL 
HHL 
X 
CALL S 
CONT/RPT: 
PUSH 


LHH 
HLH 
X 
CALL S 
CONT/RPT 


LHH 
HHL 
X 
CALL S 
CONT/RPT: 
PUSH 


HLL 
HLH 
X 
CALL A 
BR B 


HLL 
HHL 
X 
CALL A 
CALL B 


HLH 
HLH 
X 
CALL A' 116-way) 
BR B' (16-way) 


HLH 
HHL 
X 
CALL A' 116-way) 
CALL B' (16-way) 


HHL 
HLH 
X 
CALL A 
CONT/RPT 


HHL 
HHL 
X 
CALL A 
CONT/RPT: 
PUSH 


HHH 
HLH 
X 
CALL B 
CONT/RPT 


HHH 
HHL 
X 
CALL B 
CONT/RPT: 
PUSH 


MUX2- 
CC - 
L 
CC - 
H 
52-50 
05EL 
MUXO 
ZERO - 
L 
ZERO - 
H 


LLL 
HLH 
X 
CALL S 
CONT/RPT 
BR A 


LLL 
HHL 
X 
CALL S 
CONT/RPT 
CALL A 


LLH 
HLH 
X 
CALL S 
CONT/RPT 
BR B 


LLH 
HHL 
X 
CALL S 
CONT/RPT 
CALL B 


LHL 
HLH 
X 
CALL S 
BR A 
CONT/RPT 


LHL 
HHL 
X 
CALL S 
BR A 
CONT/RPT: 
PUSH 


LHH 
HLH 
X 
CALL S 
BR B 
CONT/RPT 


LHH 
HHL 
X 
CALL S 
BR B 
CONT/RPT: 
PUSH 


HLL 
HLH 
X 
CALL A 
CONT/RPT 
BR B 


HLL 
HHL 
X 
CALL A 
CONT/RPT 
CALL B 


HLH 
HLH 
X 
CALL A' 116-way) 
CONT/RPT 
BR B' 116-way) 


HLH 
HHL 
X 
CALL A' (16-way) 
CONT/RPT 
CALL B' (16-way) 


HHL 
HLH 
X 
CALL A 
BR S 
CONT/RPT 


HHL 
HHL 
X 
CALL A 
BR S 
CONT/RPT: 
PUSH 


HHH 
HLH 
X 
CALL B 
BR S 
CONT/RPT 


HHH 
HHL 
X 
CALL B 
BR S 
CONT/RPT: 
PUSH 
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subroutine 
returns 


A return from subroutine 
can be implemented 
by coding a branch to stack with a pop. Since pop is also 
conditional 
on CC and ZERO, the complex forms discussed previously 
also apply to return instructions: 
Decrement 
and Return on Nonzero; Return on Condition 
Code; Branch on Condition 
Code Else Return. 


Return encodings 
are summarized 
in Tables 8 and 9. 


MUX2-MUXO 
S2-S0 
OSEl 
CC - 
l 
CC - 
H 
III 
lHH 
X 
RET 
BR A 


llH 
lHH 
X 
RET 
BR B 


lHl 
lHH 
X 
RET 
CONT/RPT 


lHH 
lHH 
X 
RET 
CONT/RPT 


CC - 
l 
CC - 
H 
MUX2-MUXO 
S2-S0 
OSEl 
ZERO 
- 
l 
ZERO 
- 
H 
III 
lHH 
X 
RET 
CONT/RPT 
BR A 


llH 
lHH 
X 
RET 
CONT/RPT 
BR B 


lHl 
lHH 
X 
RET 
BR A 
CONT/RPT 


lHH 
lHH 
X 
RET 
BR B 
CONT/RPT 


HHl 
LHL 
X 
BR A 
RET 
CONT/RPT: 
POP 


HHH 
lHL 
X 
BR B 
RET 
CONT/RPT: 
POP 


Pulling the 52-SO pins low clears the stack and read pointers, 
and zeroes the Y output 
multiplexer 
(See 
Table 3). 


clear pointers 


The stack and read pointers can be cleared without 
affecting 
the Y output 
multiplexer 
by setting 
52-SO 
to LLH and forcing 
CC low (see Table 3). 


read stack 


Placing a high value on all of the stack inputs (52-SOl and 05EL places the' ACT8818A 
into the read mode. 
At each low-to-high 
clock transition, 
the address pointed to by the read pointer is available at the DRA 
port and the read pointer is decremented. 
The bottom 
of the stack is detected 
by monitoring 
the stack 
warning/read error pin (5TKWRN/RER). A high appears on the 5TKWRN/RER output when the stack contains 
one word and a read instruction 
is applied to the 52-SO pins. This signifies that the last address has been 
read. 


The stack pointer and stack contents 
are unaffected 
by the read operation. 
Under normal push and pop 
operations, 
the read pointer 
is updated with the stack pointer and contains 
identical 
information. 
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interrupts 


Real-time vectored 
interrupt 
routines are supported for those applications 
where polling would 
impede 
system throughput. Any instruction, including pushes and pops, may be interrupted. To process an interrupt, 
the following 
procedure should be followed: 


1. 
Place the bidirectional 
Y bus into a high-impedance 
state by forcing YOE high. 


2. 
Force the interrupt 
entry point vector onto the Y bus. INC should be high. 


3. 
Push the current value in the Interrupt Return register on the stack as the execution 
address to 
return to when interrupt 
handling is complete. 


The first instruction 
of the interrupt 
routine must push the address stored in the interrupt 
return register 
onto the stack so that proper return linkage is maintained. 
This is accomplished 
by setting 
INT and B1 
low and coding a push on the stack. 


sample microinstructions for the 'ACT8818A 


Representative 
examples of instructions 
using the' ACT8818A 
are given below. The examples assume 
a one-level pipeline system, in which the address and contents of the next instruction 
are being fetched 
while the current instruction 
is being executed, and an ALU status register contains the status results of 
the previous instruction. 


Since the incrementer looks two addresses ahead of the address in the instruction 
register to set up some 
instructions 
such as continue or repeat, a set-up instruction 
has been included with each example. This 
shows the required state of both INC and CC. CC must be set up early because the status register on 
which V-output 
selection is typically 
based contains the results of the previous instruction. 


Flow diagrams and suggested code for the sample microinstructions 
are also given below. Numbers inside 
the circles are microword address locations expressed as hexadecimal numbers. Fields in microinstructions 
are binary numbers except for inputs on DRA or DRB, which are also in hexadecimal. 
For a discussion 
of sequencing instructions, 
see the preceding section on microprogramming. 


continue 


To continue 
(Instruction 
10). INC and CC must be programmed high one cycle ahead of instruction 
10 
for pipe lining. 


MUX2-MUXO 


xxx 


110 


52-SO 


xxx 


111 


R2-RO 


XXX 


XXX 


OSEL 


X 
o 


ORA 


xxxx 
xxxx 


(Set-up) 


10 


xxxx 
xxxx 


continue 
and pop 


To continue and decrement the stack pointer (Pop), INC and CC are forced high in the previous instruction. 


MUX2-MUXO 


xxx 


110 


52-SO 


xxx 


010 


R2-RO 


xxx 
xxx 


OSEL 


X 


X 


ORA 


xxxx 
xxxx 


ORB 


xxxx 
xxxx 


(Set-up) 


10 
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continue 
and push 


To continue 
and push the microprogram 
counter onto the stack (Push), INC and CC are forced high one 


cycle ahead of Instruction 
10 for pipelining. 


(Set-up) 


10 
Continue/Push 


MUX2-MUXO 


XXX 


110 


52-SO 


XXX 


100 


R2-RO 


XXX 


XXX 


OSEL 


X 
o 


ORA 


XXXX 


XXXX 


XXX X 


XXX X 
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branch 
(example 
1) 


To branch from address 10 to address 20, CC must be programmed high one cycle ahead of Instruction 
10 
for pipelining. 


MUX2-MUXO 


xxx 


000 


(Set-up) 


10 


52-SO 


xxx 


111 


R2-RO 


XXX 


XXX 


OSEL 


X 
o 


DRA 


xxxx 


0020 


DRB 


xxxx 
xxxx 


branch 
(example 
2) 


To branch from address 10 to address 20, CC is programmed 
low in the previous instruction; 
as a result, 


a ZERO test follows 
the condition 
code test in Instruction 
10. To ensure that a ZERO = H condition 
will 


not occur, 
registers should not be decremented 
during this instruction. 


MUX2-MUXO 


xxx 


110 


(Set-up) 


10 


52-SO 


xxx 


111 


R2-RO 


XXX 


000 


OSEL 


X 
o 


DRA 


XXX X 


0020 


DRB 


XXX X 


XXXX 


sixteen-way 
branch 


To branch 16-way, 
CC is programmed high in the previous instruction. 
The branch address is derived from 


the concatenation 
DRB15-DRB4::B3-BO. 


MUX2-MUXO 


xxx 


101 


(Set-up) 


10 


52-SO 


xxx 


111 


R2-RO 


XXX 


XXX 


OSEL 


X 
o 


DRA 


XXXX 


XXXX 


DRB 


XXXX 


0040 


To branch to address 20 Else Continue to address 11, INC is set high in the preceding instruction 
to set 


up the Continue. 


MUX2-MUXO 


xxx 


110 
(Set-up) 


10 


BR A else 


Continue 


52-SO 


xxx 


111 
xxx 


000 


OSEL 


X 
o 


DRA 


XXXX 


0020 


DRB 


XXX X 


XXXX 
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three-way 
branch 


To branch 3-way, 
this example uses an instruction 
from Table 5 with 
BR A in the ZERO = L column, 
CONT/RPT in the ZERO = H column and BR B in the CC = H column. To enable the ZERO = H path, 
register A must decrement to zero during this instruction 
(see Table 4 for possible register operations). 
INC is programmed 
high in Instruction 
10 to set up the Continue. 


Address 
Instruction 
MUX2-MUXO 
52-SO 
R2-RO 
OSEL 


(Set-up) 
XXX 
XXX 
xxx 
X 


10 
Continue and 


Load Reg A 
110 
111 
010 
0 


11 
Decrement Reg A; 


Branch 3-way 
100 
111 
001 
0 


t Selected from external status 
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thirty-two-way 
branch 


To branch 32-way, 
the four least significant 
bits of the DRA' and DRB' addresses must be input at the 


B3-BO port; these are concatenated 
with the 12 most significant 
bits of DRA and DRB to provide new 


addresses DRA' (DRA 15-DRA4::B3-BO) 
and DRB' (DRB15-DRB4::B3-BO). 


MUX2-MUXO 


xxx 


101 


52-SO 


xxx 


111 


R2-RO 


xxx 


000 


OSEL 


X 
o 


ORA 


xxxx 


0040 


ORB 


xxxx 


0030 


(Set-up) 


10 


repeat 


To repeat (Instruction 
10), INC must be programmed low and CC high one cycle ahead of Instruction 
10 


for pipelining. 


(Set-up) 
10 


MUX2-MUXO 


XXX 


110 


52-SO 


XXX 


111 


R2-RO 


XXX 
xxx 


OSEL 


X 
o 


ORA 


xxxx 
xxxx 


ORB 


xxxx 
xxxx 


repeat on stack 


To continue and push the microprogram 
counter onto the stack (Push). INC and CC must be forced high 


one cycle ahead for pipelining. 


To repeat (Instruction 
12). an BR S instruction 
with ZERO = L is used. To avoid a ZERO = H condition, 
registers are not decremented 
during this instruction 
(see Table 4 for possible register operations). 
CC 


and INC are programmed 
high in Instruction 
12 to set up the Continue in Instruction 
11. 


Address 
Instruction 
MUX2-MUXO 
S2-S0 
R2-RO 
OSEL 
CC 
INC 
ORA 
ORB 


(Set-up) 
xxx 
xxx 
xxx 
X 
1 
1 
xxxx 
xxxx 


10 
Continue/Push 
110 
100 
xxx 
X 
1 
1 
xxxx 
xxx x 


11 
Continue 
110 
111 
xxx 
0 
0 
X 
xxx x 
xxxx 


12 
BR Stack 
010 
111 
000 
0 
1 
1 
xxx x 
xxx x 


repeat until CC 
H 


To continue and push the microprogram 
counter onto the stack (Push), INC and CC must be forced high 


one cycle ahead for pipelining. 


To repeat until CC = H (Instruction 12). use a BRS instruction with CC = Land CONT/RPT: POPinstruction 
with CC = H. To avoid a ZERO = H condition, 
registers are not decremented 
(See Table 4 for possible 


register 
operations). 
CC and INC are programmed 
high in Instruction 
12 to set up the Continue 
in 


Instruction 
11. A consequence 
of this is that the instruction 
following 
13 cannot be conditional. 


Address 
Instruction 
MUX2-MUXO 
52-SO 
R2-RO 
OSEL 
CC 
INC 
ORA 
ORB 


(Set-up) 
xxx 
XXX 
xxx 
X 
XXXX 
XXXX 


10 
ContinuelPush 
110 
100 
xxx 
X 
XXXX 
XXX X 


11 
Continue 
110 
111 
XXX 
0 
XXXX 
XXXX 


12 
BR Stack else 


Continue 
010 
010 
000 
X 
XXX X 
XXXX 


t Selected 
from external 
status 
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loop until zero 


To continue 
and push the microprogram 
counter onto the stack (Push), INC and CC are forced 
high one 


cycle ahead for pipelining. Register A is loaded with the loop counter using aLoad A instruction from Table 4. 


To decrement the loop count, a decrement 
register A and hold register B instruction 
from Table 4 is used. 
To Repeat Else Continue 
and Pop (decrement 
the stack pointer), 
an instruction 
from Table 5 with 
BR S 


in the ZERO = L column and CONT/RPT: POP in the ZERO = H column is used. CC is programmed 
low 


in Instruction 
11 to force the ZERO test in Instruction 
12; it is programmed 
high in Instruction 
12 to set 
up the Continue 
in Instruction 
11. 


Address 
Instruction 
MUX2-MUXO 
S2-S0 
R2-RO 
OSEL 


(Set-up) 
XXX 
XXX 
XXX 
X 


10 
ContinuelPush 
110 
100 
XXX 
0 


11 
Continue/Load 


Reg A 
110 
111 
010 
0 


12 
Decrement Reg A; 


BR S else 


Continue: 
Pop 
000 
010 
001 


conditional 
loop until zero 


ORA 


XXXX 


XXXX 


ORB 


XXXX 


XXXX 


Two examples 
of a Conditional 
Loop on Stack with 
Exit are presented 
below. 
Both use the microcode 


shown 
below to branch to the stack on nonzero, continue 
and pop on zero, and branch to ORA with 
a 
pop if CC = H. In the first example, the value on the ORA bus is the same as the value in the microprogram 
counter, 
making the exit destinations 
on the CC and ZERO tests the same. In the second, the values are 


different, 
generating 
a two-way 
exit. 


To continue 
and push the microprogram 
counter onto the stack (Push), INC must be high. CC is forced 
high in the preceding 
instruction 
for pipelining. 


To continue 
(Instruction 
11l, INC must be high. CC must be programmed 
high in the previous instruction. 


INC is programmed 
high to set up the Continue 
in Instruction 
12. 


To decrement and branch else exit (Instruction 
12), an instruction 
from Table 5 with BR S in the ZERO = L 


column, 
CONT/RPT: POP in the ZERO = H column 
and BR A: POP in the CC = H column 
is used. 


Example 1: 


Address 
Instruction 
MUX2-MUXO 
S2-S0 
R2-RO 
OSEL 
CC 
INC 
ORA 
ORB 


(Set-up) 
XXX 
XXX 
XXX 
X 
XXXX 
XXXX 
10 
Continue/Push 
110 
100 
010 
X 
XXXX 
XXXX 
Load Reg A 


11 
Continue 
110 
111 
XXX 
0 
XXXX 
XXXX 


12 
Decrement Reg A; 


BR S else 
Continue: 
Pop 


else BR A: Pop 
000 
010 
001 
X 
0013 
XXXX 
t Selected from external status 
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MUX2-MUXO 


XXX 
110 


S2-50 


XXX 
100 


R2-RO 


XXX 
010 


OSEL 


X 


X 


ORA 


XXX X 
XXXX 


ORB 


XXXX 
XXXX 
(Set-up) 
10 
Continue/Push 


Load Reg A 


Continue 
Decrement Reg A; 
BR S else 
Continue: Pop 
else BR A: Pop 


tSelected from external status 


TEXAS ~ 
INSTRUMENTS 


SN74ACT8818A 
16·81T MICROSEOUENCER 


jump to subroutine 


To call a subroutine 
at address 30, this example uses the instruction 
from Table 6 with 
CALL A in the 
CC = H column. 
CC is programmed 
high in the previous instruction. 


(Set-up) 


10 


MUX2-MUXO 


XXX 


000 


52-SO 


XXX 


110 


R2-RO 


XXX 
xxx 


OSEL 


X 


X 


ORA 


XXXX 


0030 


ORB 


XXXX 


XXXX 


conditional 
jump to subroutine 


To conditionally 
call a subroutine at address 20, this example uses an instruction 
from Table 6 with CALL 
A in the CC = L column and CONT/RPT in the CC = H column. CC is generated by external status during 
the preceding instruction. 
INC is programmed 
high in the preceding instruction 
to set up the Continue. 
To avoid a ZERO = H condition, 
registers should not be decremented 
during Instruction 
10. 


Address 
Instruction 
MUX2-MUXO 
52-SO 
R2-RO 
OSEL 
CC' 
INC 
ORA 
ORB 


(Set-up) 
XXX 
XXX 
XXX 
X 
t 
XXXX 
XXXX 


10 
Call A else 


Continue 
110 
101 
000 
X 
X 
X 
0020 
XXXX 


t Selected from external status 


two-way 
jump to subroutine 


To perform a two-way 
call to subroutine at address 20 or address 30, this example uses an instruction 
from Table 6 with CALL A in the CC = L column and CALL B in the CC = H column. 
In this example, 
CC is generated by external status during the preceding (set-up) instruction. 
INC is programmed 
high in 
the preceding instruction 
to set up the Push. To avoid a ZERO = H condition, 
registers should not be 
decremented 
during Instruction 
10. 


Address 
Instruction 
MUX2-MUXO 
52-SO 
R2-RO 
OSEL 
CC 
INC 
ORA 
ORB 


(Set-up) 
XXX 
XXX 
XXX 
X 
t 
XXXX 
XXXX 


23 
Call A else 


Call B 
100 
110 
000 
X 
X 
X 
0020 
0030 


t Selected 
from 
external 
status 


return from subroutine 


To return from a subroutine, this example uses an instruction from Table 8 with RET in the CC = L column. 
CC is programmed 
low in the previous instruction. 
To avoid a ZERO = H condition, 
registers are not 
decremented 
during Instruction 
23. 


(Set-up) 
23 


MUX2-MUXO 


XXX 


010 


52-SO 


XXX 


011 


R2-RO 


XXX 


000 


OSEL 


X 


X 


ORA 


XXXX 


XXXX 


ORB 


XXXX 


XXXX 
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To conditionally 
return from a subroutine, 
this example uses an instruction 
from Table 8 with RET in the 
CC = L column 
and CONT/RPT in the CC = H column. 
CC is selected 
from 
external 
status 
in the 
previous instruction. 
To avoid a ZERO = H condition, 
registers are not decremented during Instruction 
23. 


Address 
Instruction 
MUX2-MUXO 
S2-S0 
R2-RO 
OSEL 
CC 
INC 
ORA 
ORB 


(Set-up) 
XXX 
XXX 
XXX 
X 
XXXX 
XXXX 


23 
Return else 


Continue 
010 
011 
000 
X 
X 
X 
XXX X 
XXX X 


t Selected from external status 


clear pointers 


To continue 
(Instruction 
10). INC must be high; CC must be programmed high in the previous instruction. 
To Clear the Stack and Read Pointers and Branch to address 20 (instruction 
11). CC is programmed 
low 
in instruction 
10 to set up the Branch. To avoid a ZERO 
H condition, 
registers are not decremented 
during Instruction 
11. 


Address 
Instruction 
MUX2-MUXO 
S2-S0 
R2-RO 
OSEL 
CC 
INC 
ORA 
ORB 
(Set-up) 
XXX 
XXX 
XXX 
X 
1 
1 
XXXX 
XXXX 


10 
Continue 
110 
111 
XXX 
0 
0 
X 
0020 
XXXX 


11 
BR A and Clear 


SP/RP 
110 
001 
000 
X 
X 
X 
XXXX 
XXXX 


reset 


To reset the' ACT8818A, 
pull the 52-SO pins low. This clears the stack and read pointers and places the 
Y bus into a low state. 


MUX2-MUXO 


XXX 


S2-S0 


000 


R2-RO 


XXX 


OSEL 


X 


ORA 


XXXX 


ORB 


XXXX 
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I 
Design Support 


• 
40-ns Cycle Time 


• 
Low-Power 
1-J.'m EPIC" CMOS 


• 
Three-Port I/O Architecture 


• 
64-Word 
by 36-Bit Register File 


• 
Simultaneous 
ALU and Register Operations 


• 
Configurable 
as Quad 8-Bit or Dual 16-Bit 


Single Instruction, 
Multiple 
Data Machine 


• 
Parity Generation/Checking 


description 


The SN74ACT8832A 
is a 32-bit registered ALU 
(Arithmetic/logic 
Unit) 
that 
can 
operate 
at 
25 MHz and 25 MIPS (million instructions 
per 
second). Most instructions 
can be performed in 
a single cycle. The' ACT8832A was designed for 
applications 
that 
require 
high-speed 
logical, 
arithmetic, 
and shift 
operations 
and bit/byte 
manipulations. 


The 'ACT8832A 
can act as host CPU or can 
accelerate 
a 
host 
microprocessor. 
In 
high- 
performance graphics systems, the' ACT8832A 
generates 
display-list 
memory 
addresses and 


.controls 
the 
display 
buffer. 
In I/O controller 
applications, 
the 
'ACT8832A 
performs 
high- 
speed comparisons 
to initialize 
and end data 
transfers. 


SN74ACT8832A 
32·81T REGISTERED ALU 


GO ... 
PACKAGE 


(TOP VIEW) 


A 
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• 
B 
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e 
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A three-operand, 
64-word 
by 36-bit register file allows the' ACT8832A 
to create an instruction 
and store 


the previous result in a single cycle. 


The' ACT8832A 
registered AlU 
holds a primary position in the Texas Instruments 
family of innovative 
32-bit 
lSI devices. Compatible with the SN74AS888 
architecture 
and instruction 
set, the' ACT8832A 


performs as a high-speed microprogrammable 
32-bit registered AlU that can also be configured to operate 
as two 
16-bit AlUs 
or four 8-bit AlUs 
in single-instruction, 
multiple-data 
(SIMD) mode. 


Besides introducing 
the 'ACT8832A, 
this data sheet discusses basic concepts 
of microprogrammed 
architecture and the support tools available for system development. Details of the' ACT8832A 
architecture 
and instruction set are presented. Pin descriptions and assignments for the' ACT8832A 
are also presented. 


PRODUCTION DATA documents 
contein 
informetion 


current as of publication 
date. Products 
conform to 


specifi.ltions 
plr 
the terms 
of Te•• s Instruments 
standard warranty_ Production processing does not 
nlcesSlrily 
include 
testing 
of III plrlmlter 
•. 
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PIN 
PIN 
PIN 
PIN 
PIN 
PIN 


NO. 
NAME 
NO. 
NAME 
NO. 
NAME 
NO. 
NAME 
NO. 
NAME 
NO. 
NAME 


Al 
Y7 
C2 
Y5 
E3 
YO 
J15 
OA28 
Pl 
OA5 
51 
OB10 


A2 
Y13 
C3 
OEYO 
E4 
Y4 
J16 
OA27 
P2 
OB8 
52 
OB15 


A3 
Y15 
C4 
Y9 
E14 
Y30 
J17 
OA29 
P3 
OB12 
53 
OA10 


A4 
BYOFl 
C5 
Yl1 
E15 
TPO 
Kl 
OB6 
P4 
OA9 
54 
OA13 


A5 
5103 
C6 
Y14 
E16 
12 
K2 
OB7 
P5 
OA15 
55 
PERRA 


A6 
5102 
C7 
OEYl 
E17 
13 
K3 
OAO 
P6 
A5 
56 
A3 


A7 
IE5101 
C8 
GNO 
Fl 
EBl 
K4 
GNO 
P7 
Al 
57 
WEO 


A8 
IE5100 
C9 
VCC 
F2 
Cn 
K14 
GNO 
P8 
VCC 
58 
WE3 


A9 
5100 
Cl0 
C 
F3 
CLK 
K15 
OA24 
P9 
GNO 
59 
RFCLK 


Al0 
N 
Cll 
PERRY 
F4 
CF2 
K16 
OA25 
Pl0 
C4 
510 
B4 


All 
OE5 
C12 
Y17 
F14 
OEY3 
K17 
OA26 
Pll 
PERRB 
511 
82 


A12 
55F 
C13 
Y22 
F15 
11 
L1 
PBO 
P12 
GNO 
512 
C3 


A13 
Y18 
C14 
OEY2 
F16 
14 
L2 
OA2 
P13 
OB22 
513 
CO 


A14 
Y20 
C15 
Y28 
F17 
16 
L3 
VCC 
P14 
OA16 
514 
OB17 


A15 
Y23 
C16 
PY3 
Gl 
OBO 
L4 
GNO 
P15 
OA18 
515 
OB20 


A16 
Y24 
C17 
BYOF3 
G2 
EA 
L14 
GNO 
P16 
OA22 
516 
OB23 


A17 
Y25 
01 
CFl 
G3 
EBO 
L15 
VCC 
P17 
OB27 
517 
OA21 


Bl 
Y6 
02 
Yl 
G4 
GNO 
L16 
OB30 
Rl 
PAO 
T1 
OB14 


B2 
BYOFO 
03 
Y3 
G14 
GNO 
L17 
PB3 
R2 
OBll 
T2 
OA8 


B3 
Yl0 
04 
PYO 
G15 
15 
Ml 
OAl 
R3 
PBl 
T3 
OA12 


B4 
Y12 
05 
Y8 
G16 
17 
M2 
OA4 
R4 
OAll 
T4 
OA14 


B5 
PYl 
06 
GNO 
G17 
PA3 
M3 
OA7 
R5 
PAl 
T5 
OEA 


B6 
IE5103 
07 
GNO 
Hl 
OB2 
M4 
GNO 
R6 
A4 
T6 
A2 


B7 
IE5102 
08 
GNO 
H2 
OBl 
M14 
PA2 
R7 
AO 
T7 
WEl 


B8 
5101 
09 
VCC 
H3 
VCC 
M15 
OB26 
R8 
WE2 
T8 
5ELRFl 


B9 
Z 
010 
GNO 
H4 
GNO 
M16 
OB28 
R9 
VCC 
T9 
5ELRFO 


Bl0 
OVR 
011 
GNO 
H14 
GNO 
M17 
OB31 
Rl0 
Bl 
Tl0 
B5 


Bll 
M5ERR 
012 
No Pin 
H15 
VCC 
Nl 
OA3 
Rll 
C2 
Tll 
B3 


B12 
Y16 
013 
BYOF2 
H16 
OA31 
N2 
OA6 
R12 
OEB 
T12 
BO 


B13 
Y19 
014 
Y27 
H17 
OA30 
N3 
OB9 
R13 
OB18 
T13 
C5 


B14 
Y21 
015 
Y31 
Jl 
OB3 
N4 
OB13 
R14 
OB21 
T14 
Cl 


B15 
PY2 
016 
TPl 
J2 
084 
N14 
OA19 
R15 
PB2 
T15 
OB16 


B16 
Y26 
017 
10 
J3 
OB5 
N15 
OA23 
R16 
OA20 
T16 
OB19 


B17 
Y29 
El 
5ELMQ 
J4 
VCC 
N16 
OB25 
R17 
OB24 
T17 
OA17 


Cl 
Y2 
E2 
CFO 
J14 
Vrr 
N17 
OB29 
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.•. 
32-BIT 
REGISTERED 
ALU 


WRITE EN /1 


CLK 
CLK 
INPUT 
SELECT 


CARRY 
IN 
A PORTIO 
READ 
• · 
1: 


DA31-DAO 
REGISTER 
ADDRESS 
• · 
FILE 
SELECT 
5 


RF 
B PORTr 
READ 
• · 
OPERAND 
ADDRESS 
• · 
SELECT 
ALU 
SELECT 
5 
•.. 
ALUIMQ 
I 
WRITE I0 
SPECIAL 
SHIFTER 
ADDRESS 
: · 
SHIFT 
SELECT 
5 · 
FUNCTION 


"- 
SIO EN 


CONFIGURATION 
DA I 
PORT 


MODE 


SELECT 


TEST PINS 
PARITY 
DBI 
1 
ALU SHIFTER IOUTPUT 
110 
PORT 


MQ REGISTER 
SELECT 


0 


pORi I 


INSTRUCTIONS 


DA BUS 


PARITY 
DB BUS 


STATUS 
Y BUS 
7 
MASTER/SLAVE 


"- 
COMPARATOR 
DAO-DA31 
•.. 
SIGN 
DBO-DB31 
CARRY-OUT 
•.... 
EN 
YO-Y31 
STATUS 
•.... 
ZERO 
STATUS 
OVERFLOW 


BYTE OVERFLOW 
, 
r 


0 


~~ 


0 
· · 
~~ 
· · 
· · 
· · 
· · 
· · 
31 
31 


~ 


0· · 
'" 
· · 
IINSTRUCTION ) 
~ · · 
31 


V 


Sil50-Si03 


SSF 


PERRA 


PERRB 


PERRY 


MSERR 


N 


C 


Z 


OVR 


BY03-BYOO 
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5103-5100 


55F 


WE3-WEO 


CS-CO 


BS-BO 


4 
RFCLK 


PB3-PBO 


PERRB 


<lEB 


32 
OB31-0BO 


EB1-EBO 


/8 
•• 
17-10 
I 
I 


(3 
4 
CF2-CFO 


/ 
{2 
4 
TP1-TPO 
{4 
4 
IE5103-IE5100 
1{18 
4 
GNO 
1 
I 
{10 
4 
vcc 


REGI5TER 
FILE 
64 x 36 
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PIN 
DESCRIPTION 
110 
NAME 
NO. 


AO 
R7 


Al 
P7 


A2 
T6 
I 
Register file A port read address select 
A3 
56 


A4 
R6 


A5 
P6 


80 
Tl2 


81 
Rl0 


82 
511 
I 
Register file 8 port read address select 
83 
Tll 


84 
510 


85 
TlO 


8YOFO 
82 


8YOFl 
A4 
0 
5tatus signals indicate overflow conditions in certain data bytes 
8YOF2 
013 


8YOF3 
C17 


C 
Cl0 
0 
5tatus signal representing carry out condition 


CO 
513 


Cl 
Tl4 


C2 
Rll 
I 
Register file write address select 
C3 
512 


C4 
Pl0 


C5 
Tl3 


CFO 
E2 


CFl 
01 
I 
Configuration mode select, single 32-bit, two 16-bit, or four S-bit ALUs 


CF2 
F4 


Cn 
F2 
I 
ALU carry input 


CLK 
F3 
I 
Clocks synchronous registers on positive edge 


DAO 
K3 


DAl 
Ml 


DA2 
L2 


DA3 
Nl 


DA4 
M2 


DA5 
Pl 


DA6 
N2 


DA7 
M3 


DAB 
T2 


DA9 
P4 
A port data bus. Outputs register data (OEA= 0) or inputs external data (OEA 
1). 
1/0 
= 
DA10 
53 


DAll 
R4 


DA12 
T3 


DA13 
54 


DA14 
T4 


DA15 
P5 


DA16 
P14 


DA17 
Tl7 


DA1S 
P15 


DA19 
N14 
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PIN 
DESCRIPTION 
I/O 
NAME 
NO. 


OA20 
R16 


OA21 
517 


OAn 
P16 


OA23 
N15 


OA24 
K15 


OA25 
K16 
A port data bus. Outputs register data (OEA = 01 or inputs external data IDEA 
1). 
I/O 
= 
OA26 
K17 


OA27 
J16 


OA28 
J15 


OA29 
J17 


OA30 
H17 


OA31 
H16 


OBO 
G1 


OB1 
H2 


OB2 
H1 


OB3 
J1 


OB4 
J2 


OB5 
J3 


OB6 
K1 


OB7 
K2 


OB8 
P2 


OB9 
N3 


OB10 
51 


OB11 
R2 


OB12 
P3 


OB13 
N4 


OB14 
T1 


OB15 
52 


OB16 
T15 
I/O 
B port data bus. Outputs regiser data (DEB = 01 or used to input external data (DEB = 1) 


OB17 
514 


OB18 
R13 


OB19 
T16 


OB20 
515 


OB21 
R14 


OB22 
P13 


OB23 
516 


OB24 
R17 


OB25 
N16 


OB26 
M15 


OB27 
P17 


OB28 
M16 


OB29 
N17 


OB30 
L16 


OB31 
M17 


EA 
G2 
I 
ALU input operand select. High state selects external OA bus and low state selects register file. 


EBO 
G3 
I 
ALU input operand select. 5elects between register file, external DB port and MQ register 
EB1 
F1 
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PIN 
I/O 
DESCRIPTION 


NAME 
NO. 


GND 
C8 


GND 
D6 


GND 
D7 


GND 
D8 


GND 
D10 


GND 
D11 


GND 
G4 


GND 
G14 


GND 
H4 
Ground pins. All ground pins must be used. 


GND 
H14 


GND 
K4 


GND 
K14 


GND 
L4 


GND 
L14 


GND 
M4 


GND 
P9 


GND 
P12 


10 
D17 


11 
F15 


12 
E16 


13 
E17 
I 
Instruction 
input 
14 
F16 


15 
G15 


16 
F17 


17 
G16 


IESIOO 
A8 


IESI01 
A7 


IESI02 
I 
Shift pin enables, increases system speed and reduces bus conflict, 
active low 
B7 


IESI03 
B6 


MSERR 
B11 
0 
Master Slave Error pin, indicates error between data at Y output MUX and external Y port 


N 
A10 
0 
Output status signal representing sign condition 


OEA 
T5 
I 
DA bus enable, active low 


DEB 
R12 
I 
DB bus enable, active low 


DES 
A11 
I 
Status enable, active low 


OEYO 
C3 


OEY1 
C7 


0EY2 
I 
Y bus output enable, active low 
C14 


OEY3 
F14 


OVR 
B10 
0 
Output status signal represents overflow condition 


PAO 
R1 


PA1 
R5 


PA2 
I/O 
Parity bits port for DA data 
M14 


PA3 
G17 


PBO 
L1 


PB1 
R3 


PB2 
I/O 
Parity bits port for DB data 
R15 


PB3 
L17 
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PIN 
DESCRIPTION 
110 
NAME 
NO. 


PERRA 
S5 
a 
OA data parity error, signals error if an even parity check fails for any byte 


PERRB 
Pll 
a 
OB data parity error, signals error if an even parity check fails for any byte 


PERRY 
Cll 
a 
Y data parity error, signals error if an even parity check fails for any byte 


PYO 
04 


PYl 
B5 
1/0 
Y port parity data, input and output 
PY2 
B15 


PY3 
C16 


RFCLK 
S9 
I 
Register File Clock, allows multiple writes to be performed in one master clock cycle 


SELMQ 
El 
I 
MQ register select, selects output of ALU shilter or MQ register to be placed on Y bus 


SELRFO 
T9 
I 
Register File select. Controls selection of the Register File(RF) inputs by the RF MUX 
SELRFl 
T8 


SiCo 
A9 


SiC 1 
B8 


SiC2 
1/0 
Bidirectional shift pin, active low 
A6 


SiC3 
A5 


SSF 
A12 
I 
Special Shift Function, implements conditional shift algorithms 


TPO 
E15 


TPl 
016 
Test pins, supports system testing. Tied high for normal operation. 


VCC 
C9 


VCC 
09 


VCC 
H3 


VCC 
H15 


VCC 
J4 
Supply voltage (5 V) 
VCC 
J14 


VCC 
L3 


VCC 
l15 


VCC 
P8 


VCC 
R9 


WEO 
S7 


WEl 
T7 
Write Enable. Oata is written 
into RF when write enables are low and a low to high Register File Clock 


WE2 
R8 
IRFCLK) transition occurs. Active low. 


WE3 
S8 


YO 
E3 


Yl 
02 


Y2 
Cl 


Y3 
03 


Y4 
E4 


Y5 
C2 


Y6 
Bl 


Y7 
Al 


Y8 
05 
1/0 
Y port data bus 


Y9 
C4 


Yl0 
B3 


Yll 
C5 


Y12 
B4 


Y13 
A2 


Y14 
C6 


Y15 
A3 
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PIN 
DESCRIPTION 
I/O 
NAME 
NO. 
Y16 
B12 


Y17 
C12 


Y18 
A13 
Y19 
B13 
Y20 
A14 
Y21 
B14 
Y22 
C13 


Y23 
A15 
I/O 
Y portdatabus 
Y24 
A16 
Y25 
A17 
Y26 
B16 
Y27 
D14 
Y28 
C15 


Y29 
B17 
Y30 
E14 
Y31 
D15 


Z 
B9 
0 
Output status signal represents zero condition 


understanding 
microprogrammed 
architecture 


Figure 1 shows a simple microprogrammed 
system. 
The three basic components 
are an arithmetic/logic 


unit, a microsequencer, 
and a memory. 
The program that resides in this memory is commonly 
called the 


microprogram, 
while the memory itself is referred to as a micromemory 
or control store. The ALU performs 


all the required operations on data brought in from the external environment 
(main memory or peripherals, 
for example). 
The sequencer 
is dedicated 
to generating 
the next micromemory 
address from which 
a 


microinstruction 
is to be fetched. 
The sequencer and the ALU operate in parallel so that data processing 


and next-address 
generation 
are carried out concurrently. 


The microprogram 
instruction, 
or microinstruction, 
consists 
of control 
information 
to the ALU and the 


sequencer. 
The microinstruction 
consists 
of a number of fields of code that directly 
access and control 


the ALU, registers, 
bus transceivers, 
multiplexers, 
and other system 
components. 
This high degree of 


programmability 
in a parallel architecture 
offers greater speed and flexibility 
than a typical microprocessor, 


although 
the microinstruction 
serves the same purpose as a microprocessor 
opcode: it specifies control 


information 
by which 
the user is able to implement 
desired data processing 
operations 
in a specified 


sequence. The microinstruction 
cycle is synchronized 
to a system clock by latching the instruction 
in the 


microinstruction, 
or pipeline, register once for each clock cycle. 
Status results are collected 
in a status 


register which 
the sequencer samples to produce conditional 
branches within 
the microprogram. 
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design support 


TI's' ACT8832A 
32-bit registered ALU is supported by a variety of tools developed to aid in design evaluation 


and verification. 
These tools will streamline all stages of the design process, from assessing the operation 


and performance of the' ACT8832A 
to evaluating a total system application. 
The tools include a functional 


model, behavioral model, and microcode 
development 
software 
and hardware. 
Section 9 of this manual 


provides 
specific 
information 
on the design tools supporting 
T1's '8800 
Family. 


systems expertise 


Texas Instruments Datapath VLSI Products Systems Engineering group is available to help designers analyze 
TI's high-performance 
VLSI products, 
such as the' ACT8832A 
32-bit 
registered ALU. The group works 
directly 
with designers to provide ready answers to device-related 
questions 
and also prepares a variety 
of applications 
documentation. 


The group may be reached in Dallas, at (214) 
997-3970. 
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registered ALU 


The' ACT8832A 
is a 32-bit registered ALU that can be configured to operate as four 8-bit ALUs, two 16-bit 
ALUs, or a single 32-bit ALU. The processor instruction 
set is 100 percent upwardly 
compatible 
with the 
'AS888 
and includes 13 arithmetic 
and logical functions 
with 8 conditional 
shifts, multiplication, 
division, 
normalization, 
add and subtract 
immediate, 
bit and byte operations, 
and data conversions 
such as BCD, 
excess-3, 
and sign magnitude. 
New instructions 
permit 
internal 
flip-flops 
controlling 
BCD and divide 
operations 
to be loaded or read. 


Additional 
functions 
added to the' ACT8832A 
include byte parity and master/slave 
operation. 
Parity is 
checked at the three data input ports and generated at the Y output 
port. The 64-word 
register file is 36 
bits wide to permit storage of the parity bits. Master/slave 
comparator 
circuitry 
is provided at the Y port. 


The DA and DB ports can simultaneously 
input data to the ALU and the 64-word 
by 36-bit 
register file. 
Data and parity from the register file can be output 
on the DA and DB ports. 
Results of ALU and shift 
operations 
are output at the bidirectional 
Y port. The Y port can also be used in an input mode to furnish 
external data to the register file or during master/slave operation as an input to the master/slave comparator. 


Three 6-bit address ports allow a two-operand 
fetch and an operand write to be performed 
at the register 
file simultaneously. 
An MQ shifter 
and MQ register can also be configured 
to function 
independently 
to 
implement double-precision 
8-bit, 16-bit, and 32-bit shift operations. An internal ALU bypass path increases 
the speeds of multiply, divide and normalize instructions. 
The path is also used by 'ACT8832A 
instructions 
that 
permit bits and bytes to be manipulated. 


architecture 


Control 
input signals are summarized 
in Table 1. Data flow 
and details of the functional 
elements 
are 
presented 
in the following 
paragraphs 
(see the functional 
block diagram). 


SIGNAL 
HIGH 
LOW 


CF2-CFO 
See Table 5 
See Table 5 


EA 
Selects external DA bus 
Selects register file 


EB1-EBO 
See Table 3 
See Table 3 


iESiO'3-IES100 
Normal operation 
Force corresponding SIO 


inputs to high impedance 


17·10 
See Table 9 
See Table 9 


MOSEL 
Selects MO register 
Selects ALU 


OEA 
Inhibits DA and PA output 
Enables DA and PA output 


OEB 
Inhibits DB and PB output 
Enables DB and PB output 


0EY3-0EY0 
Inhibits Y and PY outputs 
Enables Y and PY outputs 


SELRF1-SELRFO 
See Table 2 
See Table 2 


SSF 
Selects shifted ALU output 
Selects ALU (unshifted) output 


TP1-TPO 
See Table B 
See Table 8 


WE3-WEO 
Inhibits register file write 
Byte enables for register file 


write (0 = LSB) 
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As shown in Figure 2, data enters the' ACT8832A 
from three primary sources: the bidirectional 
Y port, 
which 
is used in an input mode to pass data to the register file; and the bidirectional 
DA and DB ports, 
used to input data to the register file or the Rand 
S buses serving the ALU. Three associated 
I/O ports 


(PY, PA, and PBI are provided for associated 
parity data input and output. 


Data is input to the ALU through 
two 
multiplexers: 
R MUX, which 
selects the R bus operand from the 


DA port or the register file addressed by A5-AO; and S MUX, which 
selects data from the DB port, the 


register file addressed by B5-BO, or the multiplier-quotient 
(MQl register. 


The result of the ALU operation 
is passed to the ALU shifter, 
where it is shifted 
or passed without 
shift 
to the Y bus for possible output 
from the' ACT8832A 
and to the feedback 
MUX for possible storage in 
the internal register file. The MQ shifter, 
which 
operates in parallel with the ALU shifter, 
can be loaded 


from the ALU or the MQ register. The MQ shift result is passed to the MQ register, where it can be routed 
through 
the S MUX to the ALU or to the Y MUX for output 
from the chip. 


An internal 
bypass path allows 
data from the S MUX to be loaded directly 
into the ALU shifter 
or the 
divide/BCD 
flip-flops. 
Data from the divide/BCD 
flip-flops 
can be output 
via the MQ register. 


Data can be output 
from the three bidirectional 
ports, Y, DA, and DB, and their associated 
parity ports, 
PY, PA, and PB. DA and DB can also be used to read ALU input data on the Rand 
S buses for debug 
or other special purposes. 


SElRF1· 
RF MUX 


SElRFO 
36 
36 


WE3·WEO 


A5-AO 
Cs-CO 
REGISTER 
FilE 
8S-BO 


64X36 
RFClK 


PA3-PAO 
P83·PBO 


32 


EA 
EB'·EBO 


DA31·DAO 


Cn 
DB31-DBO 
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three-port 
register 
file 


The register file is 36 bits wide, 
permitting 
storage of a 32-bit 
data word with 
its associated 
parity bits. 
The 64 registers are accessed by three address ports. C5-CO address the destination 
register during write 


operations; 
A5-AO and B5-BO address any two 
registers 
during read operations. 
The address buses are 


also used to furnish 
immediate 
data to the ALU: A3-AO to provide constant 
data for the add and subtract 


immediate 
instructions; 
C3-CO and A3-AO to provide 
masks for set, reset, and test bit operations. 


Data is written 
into the register file when the write enable is low and a low-to-high 
register file clock (RFCLK) 
transition 
occurs. 
The separate 
register 
file clock allows 
multiple 
writes 
to be performed 
in one master 


clock cycle, allowing 
processors in multi-processor 
environments,to 
update one another's 
internal register 
files during a single cycle. 


Four write 
enable inputs are provided to allow separate control 
of data inputs in a byte-oriented 
system. 
WE3 is the write 
enable for the most significant 
byte. 


Register file inputs are selected by the RF MUX under the control of two register file select signals, SELRF1 
and SELRFO, shown 
in Table 2 (see also Table 4). 


SELRFl 
SELRFO 
SOURCE 
a 
a 
External DA input 
a 
1 
External DB input 


1 
a 
V-output 
MUX 


1 
1 
External Y port 


Rand 
S multiplexers 


ALU inputs are selected by the Rand S multiplexers. 
Controls which affect operand selection for instructions 
other than those 
using constants 
or masks are shown 
in Table 3. 


R-BUS 
S-BUS 


OPERAND 
OPERAND 
RESULT 


SELECT 
SELECT 
-SOURCE 
OPERAND 
DESTINATION 


EA 
EB1-EBO 


a 
R bus 
-Register 
file addressed by A5-AO 


1 
R bus 
-DA 
port 


00 
S bus 
-Register 
file addressed by B5-BO 


10 
S bus 
-DB 
port 


Xl 
S bus 
-MO register 


data input and output 
ports 


The DA and DB ports can be used to load the Sand/or 
R multiplexers 
from an external 
source or to read 


S or R bus outputs 
from the register 
file. The Y port can be used to load the register 
file and to output 


the next address selected by the Y output multiplexer. Tables 9 and 10 describe the MUX and output controls 
which 
affect 
DA, DB, and Y. 
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REGISTER 
DA 
DB 


Y BUS 
REGISTER 


FILE 
Y MUS 
PORT 
PORT 


OUTPUT 
FILE 
RESULT 


WRITE 
SELECT 
OUTPUT 
OUTPUT 
- 
SOURCE 
ENABLE 
SELECT 
DESTINATION 


ENABLE 
OEY 
MOSEL 
ENABLE 
ENABLE 


WE 


RFSEL1-RFSELO 
OEA 
OEB 


1 
0 
0 
X 
X 
YIPY 
- ALU shifter/parity 
generate 


1 
0 
1 
X 
X 
YIPY 
- MQ register/parity 
generate 


0 
0 
0 
1 
0 
Y/PY, RF 
- ALU shifter/parity 
generate 


0 
0 
1 
1 
0 
YIPY, RF 
- MQ register/parity 
generate 


0 
1 
X 
1 
1 
RF 
- External Y/PY 


0 
X 
X 
0 
0 
1 
X 
RF 
- External DA/PA 


0 
X 
X 
0 
1 
X 
1 
RF 
- External DB/PB 


0 
DAIPA 
- R bus register file output 


1 
DA/PA 
Hi-Z 


0 
DB/PB 
- S bus register file output 


1 
DB/PB 
Hi-Z 


The ALU can perform seven arithmetic 
and six logical instructions 
on the two 
32-bit 
operands selected 


by the Rand S multiplexers. 
It also supports multiplication, 
division, normalization, 
bit and byte operations 


and data conversion, 
including excess-3 BCD arithmetic. 
The' ACT8832A 
instruction 
set is summarized 


in Table 9_ 


The' ACT8832A 
can be configured to operate as a single 32-bit ALU, two 16-bit ALUs, or four 8-bit ALUs 


(see Figures 3 and 4). It can also be configured 
to operate on a 32-bit 
word formed 
by adding leading 


zeros to the 12 least significant bits of R bus data. This is useful in certain IBM'" relative addressing schemes. 


Configuration 
modes are controlled 
by three CF inputs as shown in Table 5. These signals also select the 


data from which 
status signals other than byte overflow 
will be generated. 


CONTROL INPUTS 
DATA FROM WHICH STATUS OTHER 
MODE SELECTED 
CF2 
CF1 
CFO 
THAN BYOF WILL BE GENERATED 


0 
0 
0 
Four B-bit 
Byte 0 


0 
0 
1 
Four B-bit 
Byte 1 


0 
1 
0 
Four B-bit 
Byte 2 


0 
1 
1 
Four B-bit 
Byte 3 


1 
0 
0 
Two 16-bit 
Least significant 
16-bit word 


1 
0 
1 
Two 16-bit 
Most significant 
16-bit word 


1 
1 
0 
One 32-bit 
32-bit 
word 


1 
1 
1 
Masked 32-bit 
32-bit 
word 
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The ALU and MQ shifters 
are used in all of the shift, 
multiply, 
divide and normalize functions. 
They can 
be used independently 
for single-precision 
or concurrently 
for double-precision 
shifts. 
Shifts can be made 
conditional, 
using the Special Shift Function 
(SSF) pin. 


bidirectional 
serial I/O pins 


Four bidirectional 
SID pins are provided to supply an end fill bit for certain shift instructions. 
These pins 
may also be used to read bits that are shifted out of the ALU or MQ shifters 
during certain instructions. 
Use of the SiO pins as inputs or outputs 
is summarized 
in Table 11. 


The four pins allow separate control of end fill inputs in configurations 
other than 32-bit mode (see Table 6 
and the functional 
block diagram). 


SIGNAL 
CORRESPONDINGWORD. PARTIAL WORD OR BYTE 


32-BIT MODE 
16-BIT MODE 
8-81T MODE 
5103 
- 
- 
Byte 3 


5102 
- 
most significant 
word 
Byte 2 
Si01 
- 
- 
Byte 1 


5100 
32-bit word 
least significant 
word 
Byte 0 
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To increase system speed and reduce bus conflict, 
four SIO input enables (IESI03-IESIOO) are provided. 
A low on these enables will override internal 
pull-up resistor logic and force the corresponding 
SIO pins 


to the high-impedance 
state required before an input signal can appear on the signal line. If the SIO enables 


are not used, this condition 
is generated internally 
in the chip. Use of the enables allow internal decoding 


to be bypassed, 
resulting 
in faster 
speeds. 


The IESIOs are defaulted 
to a high because of internal 
pull-up resistors. 
When an SIO pin is used as an 


output, 
a low on its corresponding 
IESIO pin would force SIO to a high impedance state. The output would 


then be lost, but the internal 
operation 
of the chip would 
not be affected. 


MQ register 


Data from the MQ shifter 
is written 
into the MQ register when a low-to-high 
transition 
occurs 
on clock 


CLK. The register has specific functions in double precision shifts, multiplication, 
division and data conversion 


algorithms 
and can also be used as a temporary 
storage register. 
Data from the register file and the DA 


and DB buses can be passed to the MQ register through 
the ALU. 


The Y bus contains 
the output 
of the ALU shifter 
if SELMQ is low and the output 
of the MQ register 
if 


SELMQ is high. If OEY is low, ALU or MQ shifter 
output 
will be passed to the Y port; if OEY is high, the 


Y port becomes an input to the feedback 
MUX. 


conditional 
shift 
pin 


Conditional shifting algorithms 
may be implemented 
using the SSF pin under hardware or firmware 
control. 


If the SSF pin is high or floating, 
the shifted 
ALU output 
will be sent to the output 
buffers. 
If the SSF 


pin is pulled low externally, 
the ALU result will be passed directly 
to the output 
buffers, 
and MQ shifts 


will be inhibited. 
Conditional 
shifting 
is useful for scaling 
inputs 
in data arrays or in signal processing 


algorithms. 


master/slave 
comparator 


A master/slave 
comparator 
is provided to compare data bytes from the Y output 
MUX with 
data bytes 


on the external Y port when OEY is high. If the data are not equal, a high signal is generated on the master 
slave error output 
pin (MSERR). A similar comparator 
is provided 
for the Y parity 
bits. 


divide/BCD 
flip-flops 


Internal 
multiply/divide 
flip-flops 
are used by certain 
multiply 
and divide instructions 
to maintain 
status 


between 
instructions. 
Internal excess-3 
BCD flip-flops 
preserve the carry from each nibble in excess-3 


BCD operations. 
The BCD flip-flops 
are affected 
by all instructions 
except 
NOP and are cleared when a 


CLR instruction 
is executed. 
The flip-flops 
can be loaded and read externally 
using instructions 
LOADFF 


and DUMPFF (see Table 9). This feature 
permits 
an iterative 
arithmetic 
operation 
such as multiplication 


or division 
to be interrupted 
immediately 
so that an external 
interrupt 
can be processed. 


Eight status output signals are generated by the' ACT8832A. 
Four signals (BYOF3-BYOFO) indicate overflow 


conditions 
in certain data bytes (see Table 7). The others represent sign (N). zero (ZERO), carry-out 
(Cout) 


and overflow 
(OVR). N, ZERO, Cout, and OVR are generated from data selected by the mode configuration 


controls 
(CF2-CFO) as shown 
in Table 5. 


Carry-out is evaluated after each ALU operation. Sign and zero status are evaluated after ALU shift operation. 
Overflow 
(OVR) is determined 
by ORing the overflow 
result from the ALU with the overflow 
result from 


the ALU shifter. 
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SIGNAL 
CORRESPONDINGWORD, PARTIAL WORD OR BYTE 


32-BIT MODE 
16-BIT MODE 
B-BIT MODE 


BYOF3 
32-bit word 
most significant 
word 
Byte 3 


BYOF2 
- 
- 
Byte 2 


BYOF1 
- 
least significant 
word 
Byte 1 


BYOFO 
- 
- 
Byte 0 


input data parity check 


An even parity check is performed on each byte of input data at the DA, DB and Y ports. The check is 
performed 
by counting the number of ones in each byte and its corresponding 
parity bit. Parity bits are 
input on PA for DA data, PB for DB data and PYF or Y data. PAO, PBO and PYO are the parity bits for 
the least significant 
bytes of DA, DB and Y, respectively. 
If the result of the parity count is odd for any 
byte, a high appears at the parity error output pin (PERRAfor DA data, PERRBfor DB data, PERRYfor Y data). 


test pins 


Two pins, TP1-TPO, support system testing. 
These may be used, for example, to place all outputs 
in a 
high-impedance 
state, isolating the chip from the rest of the system (see Table 8). 


TPl 
TPO 
RESULT 
0 
0 
All outputs and liDs forced low 


1 
0 
All outputs and liDs forced high 
0 
1 
All outputs and liDs placed in a high impedance state 


1 
1 
Normal operation (default state I 


instruction set overview 


Bits 17-10 are used as instruction 
inputs to the' ACT8832A. 
Table 9 lists all instructions 
divided into five 
groups with their opcodes and mnemonics. 
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GROUP 1 INSTRUCTIONS 


INSTRUCTION 
BITS 


13-10 
MNEMONIC 
FUNCTION 


(HEX) 


0 
Used to access Group 4 instructions 


1 
ADD 
R + S + Cn 


2 
SUBA 
R + S + Cn 
3 
SUBS 
A + S + Cn 


4 
INCS 
S + Cn 


5 
INCNS 
S+ 
Cn 
6 
INCA 
A + Cn 


7 
INCNA 
R + Cn 
8 
Used 
to 
access 
Group 
3 instructions 
9 
XOA 
A XOA S 


A 
AND 
A AND S 


B 
OA 
A OA S 


C 
NAND 
A NAND S 


D 
NOA 
A NOA S 


E 
ANDNA 
RAND 
S 


F 
Used 
to access 
Group 
5 instructions 


GROUP 2 INSTRUCTIONS 


INSTRUCTION 
BITS 


17-14 
MNEMONIC 
FUNCTION 


(HEX) 
0 
SAA 
Arithmetic 
right single-precision shift 


1 
SAAD 
Arithmetic 
right double-precision shift 


2 
SAL 
Logical right single-precision sljift 
3 
SALD 
Logical right double-precision shift 


4 
SLA 
Arithmetic 
left single-precision shift 


5 
SLAD 
Arithmetic 
left double-precision shift 
6 
SLC 
Circular left single-precision shift 


7 
SLCD 
Circular left double-precision shift 
8 
SAC 
Circular 
right 
single-precision 
shift 


9 
SACD 
Circular right double-precision shift 


A 
MQSAA 
Arithmetic 
right shift MQ register 


B 
MQSAL 
Logical right shift MQ register 


C 
MQSLL 
Logical left shift MQ register 
D 
MQSLC 
Circular left shift MQ register 


E 
LOADMQ 
Load MQ register 


F 
PASS 
Pass ALU to Y 
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GROUP 3 INSTRUCTIONS 


INSTRUCTION BITS 


17-10 
MNEMONIC 
FUNCTION 


(HEX) 


08 
SET1 
Set bit 1 


18 
SETO 
Set bit 0 


28 
TB1 
Test bit (one) 


38 
TB 
Test bit (zero) 


48 
ABS 
Absolute value 


58 
SMTC 
Sign magnitude/two's 
complement 


68 
ADDI 
Add immediate 


78 
SUBI 
Subtract immediate 


88 
BADD 
Byte add R to S 


98 
BSUBS 
Byte subtract 
S from R 


A8 
BSUBR 
Byte subtract 
R from S 


B8 
81NCS 
Byte increment S 


C8 
81NCNS 
Byte increment negative S 


D8 
8XOR 
Byte XOR Rand 
S 


E8 
8AND 
Byte AND Rand 
S 


F8 
BOR 
Byte OR Rand 
S 


GROUP 4 INSTRUCTIONS 


INSTRUCTION BITS 


17-10 
MNEMONIC 
FUNCTION 


(HEX) 


00 
CRC 
Cyclic 
redundancy 
character 
accumulation 


10 
SEL 
Select S or R 


20 
SNORM 
Single-length normalize 


30 
DNORM 
Double-length 
normalize 


40 
DIVRF 
Divide 
remainder 
fix 


50 
SDIVQF 
Signed divide quotient fix 


60 
SMUll 
Signed multiply 
iterate 


70 
SMULT 
Signed multiply terminate 


80 
SDIVIN 
Signed 
divide 
initialize 


90 
SDIVIS 
Signed divide start 


AO 
SDIVI 
Signed divide iterate 


BO 
UDIVIS 
Unsigned divide start 


CO 
UDIVI 
Unsigned divide iterate 


DO 
UMULI 
Unsigned multiply iterate 


EO 
SDIVIT 
Signed divide terminate 


FO 
UDIVIT 
Unsigned divide terminate 
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GROUP 5 INSTRUCTIONS 


INSTRUCTION BITS 


17-10 
MNEMONIC 
FUNCTION 


(HEX) 


OF 
LOADFF 
Load divide/BCD flip-flops 


1F 
CLR 
Clear 


2F 
CLR 
Clear 


3F 
CLR 
Clear 


4F 
CLR 
Clear 


5F 
DUMPFF 
Output divide/BCD flip-flops 


6F 
CLR 
Clear 


7F 
BCDBIN 
BCD to binary 


SF 
EX3BC 
Excess-3 byte correction 


9F 
EX3C 
Excess-3 
word 
correction 


AF 
SDIVO 
Signed divide overflow test 


BF 
CLR 
Clear 


CF 
CLR 
Clear 


DF 
BINEX3 
Binary 
to excess·3 


EF 
CLR 
Clear 


FF 
NOP 
No 
operation 


Group 1, a set of ALU arithmetic 
and logic operations, 
can be combined 
with the user-selected 
shift 
operations 
in Group 2 in one instruction 
cycle. The other groups contain instructions 
for bit and byte 
operations, division and multiplication, 
data conversion, and other functions such as sorting, normalization 
and polynomial 
code accumulation. 


arithmetic/logic 
instructions 
with shifts 


The seven Group 1 arithmetic instructions 
operate on data from the Rand/or S multiplexers and the carry- 
in. Carry-out is evaluated after ALU operation; 
other status pins are evaluated after the accompanying 
shift operation, when applicable. Group 1 logic instructions do not use carry-in; carry-out is forced to zero. 


Possible shift instructions are listed in Group 2. Fourteen single and double precision shifts can be specified, 
or the ALU result can be passed unshifted to the MQ register or to the specified output 
destination 
by 
using the LOADMQ or PASS instructions. 
Table 10 lists shift definitions. 


SHIFT TYPE 
NOTES 


Left 
Moves a bit one position towards the most significant 
bit 


Right 
Moves a bit one position towards the least significant bit 


Arithmetic 
right 
Retains 
the sign unless 
an overflow 
occurs, 
in which 
case, 
the 


sign would be inverted 


Arithmetic 
left 
May lose the sign bit if an overflow occurs. Zero is filled into 


the least significant 
bit unless the bit is set externally 


Circular right 
Fills the least significant bit in the most significant bit position 


Circular left 
Fills the most significant bit in the least significant bit position 


Logical right 
Fills a zero in the most significant bit position unless the bit 


is forced to one by placing a zero on an 510 pin 


Logical left 
Fills a zero in the least significant bit position unless the bit 


is forced to one by placing a zero on an 510 pin 
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When using the shift registers for double-precision 
operations, 
the least significant 
half should be placed 


in the MQ register and the most significant 
half in the ALU for passage to the ALU shifter. 
An example 
of a double-precision 
shift 
using the ALU and MQ shifters 
is given in Figure 5. 


SERIAL DATA 
INPUT SIGNALS 


SERIAL DATA 
INPUT SIGNALS 


All Group 2 shifts 
can be made conditional 
using the conditional 
shift pin (SSF). If the SSF pin is high or 
floating, 
the shifted 
ALU output 
will be sent to the output 
buffers, 
MQ register, 
or both. If the SSF pin 


is pulled low, the ALU result will be passed directly to the output buffers and any MQ shifts will be inhibited. 


The bidirectional 
SIO pins can be used to supply external end fill bits for certain Group 2 shift instructions. 


When SIO is high or floating, 
a zero is filled, otherwise 
a 1 is filled. Table 11 lists instructions 
that make 
use of the SIO inputs and identifies 
input and output 
functions. 
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INSTRUCTION 
SIO 


BITS 17-10 
MNEMONIC 
110 
DATA 
(HEX) 


0- 
SRA 
0 
Shift out 


1 - 
SRAD 
0 
Shift out 


2- 
SRL 
I 
Most significant 
bit 


3- 
SRLD 
I 
Most significant 
bit 
4- 
SLA 
I 
Least significant 
bit 
5- 
SLAD 
I 
Least significant 
bit 


6- 
SLC 
0 
Shifted input to MO shifter 
7- 
SLCD 
0 
Shifted input to MO shifter 


S- 
SRC 
0 
Shifted input to ALU shifter 


9- 
SRCD 
0 
Shifted input to ALU shifter 


A- 
MOSRA 
0 
Shift out 


B- 
MOSRL 
I 
Most significant 
bit 


C- 
MOSLL 
I 
Least significant bit 
0- 
MOSLC 
0 
Shifted input to MO shifter 


00 
CRC 
0 
Internally generated end fill bit 


20 
SNORM 
I 
Least significant 
bit 


30 
DNORM 
I 
Least significant 
bit 


60 
SMUll 
0 
ALUO 


70 
SMULT 
0 
ALUO 


SO 
SDIVIN 
0 
Internally generated end fill bit 


90 
SDIVIS 
0 
Internally generated end fill bit 


AO 
SDIVI 
0 
Internally generated end fill bit 


BO 
UDIVIS 
0 
Internally generated end fill bit 


CO 
UDIVI 
0 
Internally generated end fill bit 


DO 
UMULI 
0 
Internal input 


EO 
SDIVT 
0 
Internally generated end fill bit 


FO 
UDIVIT 
0 
Internally generated end fill bit 


7F 
BCDBIN 
I 
Least significant 
bit 


OF 
BINEX3 
0 
Shifted input to MO register 


other arithmetic instructions 


The' ACT8832A 
supports two immediate 
arithmetic 
operations. 
ADDI and SUBI (Group 3) add or subtract 
a constant 
between the values of 0 and 15 from an operand on the S bus. The constant 
value is specified 
in bits A3-AO. 


Twelve 
Group 
4 instructions 
support 
serial division 
and multiplication. 
Signed. 
unsigned 
and mixed 
multiplication 
are implemented 
using three instructions: 
SMUll, 
which performs 
a signed times unsigned 
iteration; 
SMUL T, which 
provides 
negative 
weighting 
of the sign bit of a negative 
multiplier 
in signed 
multiplication; 
and UMULI, which 
performs 
an unsigned 
multiplication 
iteration. 
Algorithms 
using these 
instructions 
are given in Tables 12, 13, and 14. These include: 
signed multiplication. 
which 
performs 
a 
two's 
complement 
multiplication; 
unsigned multiplication, 
which 
produces 
an unsigned times unsigned 
product; and mixed multiplication 
which multiplies a signed multiplicand by an unsigned multiplier to produce 
a signed result. 
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OP 
CLOCK 
INPUT 
INPUT 
OUTPUT 
MNEMONIC 
CYCLES 
SPORT 
R PORT 
Y PORT 
CODE 


E4 
LOADMQ 
1 
Multiplier 
- 
Multiplier 


60 
SMUll 
N-1 t 
Accumulator 
Multiplicand 
Partial product 


70 
SMULT 
1 
Accumulator 
Multiplicand 
Product (MSH) * 


OP 
CLOCK 
INPUT 
INPUT 
OUTPUT 
MNEMONIC 


CODE 
CYCLES 
SPORT 
R PORT 
Y PORT 


E4 
LOADMQ 
1 
Multiplier 
- 
Multiplier 


DO 
UMULI 
N-1 t 
Accumulator 
Multiplicand 
Partial product 


DO 
UMULI 
1 
Accumulator 
Multiplicand 
Product (MSH) * 


OP 
CLOCK 
INPUT 
INPUT 
OUTPUT 
MNEMONIC 
CODE 
CYCLES 
SPORT 
R PORT 
Y PORT 


E4 
LOADMQ 
1 
Multiplier 
- 
Multiplier 


60 
SMUll 
N-1 t 
Accumulator 
Multiplicand 
Partial product 


60 
SMUll 
1 
Accumulator 
Multiplicand 
Product (MSH) * 


tN = 8 for quad 8-bit mode. 16 for dual 16-bit mode. 32 for 32-bit mode. 
*The least significant 
half of the product is in the MQ register. 


Instructions 
that support 
division 
include start, 
iterate, 
and terminate 
instructions 
for unsigned 
division 


routines (UDIVIS, UDIVI and UDIVIT); initialize, start, iterate, and terminate 
instructions 
for signed division 


routines 
(SDIVIN, SDIVIS, SDIVI, and SDIVIT); and correction 
instructions 
for these routines 
(DIVRF and 


SDIVQF). A Group 5 instruction, 
SDIVO, is available for optional 
overflow 
testing. 
Algorithms 
for signed 


and unsigned division are given in Tables 15 and 16. These use a nonrestoring 
technique 
to divide a 16 N- 


bit integer dividend 
by an 8 N-bit integer divisor to produce 
an 8 N-bit integer quotient 
and remainder, 
where 
N = 
1 for quad 8-bit mode, N = 2 for dual 16-bit 
mode, and N = 4 for 32-bit 
mode. 


OP 
CLOCK 
INPUT 
INPUT 
OUTPUT 
MNEMONIC 
CODE 
CYCLES 
SPORT 
R PORT 
Y PORT 


E4 
LOADMQ 
1 
Dividend (LSH) 
- 
Dividend (LSHI 


80 
SDIVIN 
1 
Dividend IMSH) 
Divisor 
Remainder IN) 


AF 
SDIVO 
1 
Remainder IN) 
Divisor 
Overflow Test 


Result 


90 
SDIVIS 
1 
Remainder (N) 
Divisor 
Remainder (N) 


AO 
SDIVI 
N-2t 
Remainder (N) 
Divisor 
Remainder (N) 


EO 
SDIVIT 
1 
Remainder (N) 
Divisor 
Remainder§ 


40 
DIVRF 
1 
Remainder* 
Divisor 
Remainder' 


50 
SDIVQF 
1 
MQ register 
Divisor 
Quotient # 


tN = 8 for quad 8-bit mode. 16 for dual 16-bit mode. 32 for 32-bit mode. 
*The least significant 
half of the product is in the MQ register. 


§Unfixed 
'Fixed 
(corrected) 
# The quotient is stored in the MQ register. Remainder can be output at the Y port or 
stored in the register file accumulator. 
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OP 
CLOCK 
INPUT 
INPUT 
OUTPUT 
MNEMONIC 
CYCLES 
SPORT 
R PORT 
Y PORT 
CODE 


E4 
LOADMQ 
1 
Dividend (LSH) 
- 
Dividend (LSH) 


BO 
UDIVIS 
1 
pividend 
(MSH) 
Divisor 
Remainder IN) 


CO 
UDIVI 
N-1 t 
Remainder IN) 
Divisor 
Remainder IN) 


FO 
UDIVIT 
1 
Remainder IN) 
Divisor 
Remainder* 


40 
DIVRF 
1 
Remainder§ 
Divisor 
Remainder§ 


tN = 8 in quad 8-bit mode, 16 in dual 16-bit mode, 32 in 32-bit mode 
*Unfixed 
§Fixed (corrected) 


data conversion instructions 


Conversion 
of binary data to one's and two's 
complement 
can be implemented 
using the INCNR instruction 


(Group 1). SMTC (Group 3) permits conversion 
from two's 
complement 
representation 
to sign magnitude 


representation, 
or vice versa. Two's 
complement 
numbers can be converted 
to their positive 
value, using 


ABS (Group 3). 


SNORM and DNORM (Group 4) provide for normalization 
of signed, single- and double-precision 
data. The 


operand is placed in the MQ register and shifted toward the most significant 
bit until the two most significant 


bits are of opposite 
value. Zeroes are shifted 
into the least significant 
bit, provided 
SIO is high or floating. 


(A low on SID will shift 
a one into the least significant 
bit.) SNORM allows 
the number 
of shifts 
to be 


counted 
and stored 
in one of the register 
files to provide 
the exponent. 


Data stored 
in binary-coded 
decimal form can be converted 
to binary using BCDBIN (Group 5). A routine 


for this conversion, 
given in Table 17, allows the user to convert 
an N-digit BCD number to a 4N-bit 
binary 


number 
in 4N + 8 clock 
cycles. 


OP 
CLOCK 
INPUT 
INPUT 
OUTPUT 
MNEMONIC 
CODE 
CYCLES 
SPORT 
R PORT 
DESTINAnON 


E4 
LOADMQ 
1 
BCD operand 
- 
MQ reg. 


D2 
SUBR/MQSLC 
1 
Accumulator 
Accumulator 
Accumulator/MQ 
reg. 


D2 
SUBR/MQSLC 
1 
Mask reg. 
Mask reg. 
Mask reg/MQ reg. 


D1 
MQSLC 
2 
Don't care 
Don't care 
MQ reg. 


68 
ADDI (15) 
1 
Accumulator 
Decimal 15 
Mask reg. 


REPEAT N-1 TIMESt 


DA 
AND/MQSLC 
1 
MQ reg. 
Mask reg. 
Interim reg/MQ reg. 


D1 
ADD/MQSLC 
1 
Accumulator 
Interim reg. 
Interim reg/MQ reg. 


7F 
BCDBIN 
1 
Interim reg. 
Interim res. 
Accumulator/MQ 
reg. 


7F 
BCDBIN 
1 
Accumulator 
Interim reg. 
Accumulator/MQ 
reg. 


END REPEAT 


FA I AND 
1 
MQ reg. 
Mask reg. 
Interim reg. 


D1 
ADD MQSLC 
1 
Accumulator 
Interim reg. 
Accumulator 


BINEX3, EX3BC, and EX3C assist binary to excess-3 
conversion. 
Using BINEX3, an N-bit binary number 


can be converted 
to an N/4- digit excess-3 
number. 
For an algorithm, 
see Table 
18. 
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OP 
CLOCK 
INPUT 
INPUT 
OUTPUT 
MNEMONIC 
SPORT 
R PORT 
DESTINATION 
CODE 
CYCLES 


E4 
LOADMO 
1 
Binary numbe 
- 
MO reg. 


02 
SUBR 
1 
Accumulator 
Accumulator 
Accumulator 


02 
SET1 133)16 
1 
Accumulator 
Mask 133116 
Accumulator 


REPEAT N TIMES* 


OF I BINEX3 
1 
Accumulator 
Accumulator 
Accumulator/MO 
reg 


9F 
EX3C 
1 
Accumulator 
Internal data 
Accumulator 


END REPEAT 


bit and byte instructions 


Four Group 3 instructions 
allow the user to test or set selected bits within 
a byte. SET1 and SETa force 
selected 
bits of a selected 
byte (or bytes) to one and zero, respectively. 
TB1 and TB test selected 
bits 
of a selected byte (or bytes) for ones and zeros. The bits to be set or tested are specified 
by an 8-bit mask 
formed by the concatentation 
of register file address inputs C3-CO and A3-AO. The register file addressed 
by B5-BO is used as the destination 
operand for the set bit instructions. 
Register writes 
are inhibited 
for 
test bit instructions. 
Bytes to be operated 
on are selected 
by forcing 
Sian 
low, where 
n represents 
the 
byte position 
and a represents 
the least significant 
byte. A high on the zero output 
pin signifies 
that the 
test data matches 
the mask; a low on the zero output 
indicates 
that the test has failed. 


Individual 
bytes of data can also be manipulated 
using eight Group 3 byte arithmetic/logic 
instructions. 
Bytes can be added, subtracted, 
incremented, 
ORed, ANDed, and exclusive ORed. Like the bit instructions, 
bytes are selected 
by forcing 
Sian 
low, but multiple 
bytes can be operated 
on only if they are adjacent 
to one another; 
at least one byte must be nonselected. 


other instructions 


SEL (Group 4) selects one of the ALU's two 
operands, 
S or R, depending 
on the state of the SSF pin. 
This instruction 
could be used in sort routines to select the larger or smaller of two operands by performing 
a subtraction 
and sending the status result to SSF. CRC (Group 4) is designed to verify serial binary data 
that has been transmitted 
over a channel using a cyclic redundancy 
check code. An algorithm 
using this 


instruction 
is given in Table 19. 


OP 
CLOCK 
INPUT 
INPUT 
OUTPUT 
MNEMONIC 
CODE 
CYCLES 
SPORT 
R PORT 
DESTINATION 


E4 
LOADMO 
1 
Vector c'(xl t 
- 
MO reg. 


F6 
INCR 
1 
- 
Polynomial g(xl 
Poly reg. 


F2 
SUBR 
1 
Accumulator 
Accumulator 
Accumulator 


REPEAT n/8N TIMEst 


00 
CRC 
1 
Accumulator 
Poly reg. 
Accumulator 


E4 
LOADMO 
1 
Vector c'(x) t 
- 
MO reg. 


END REPEAT 


t N = Number of bits in binary number 
n = Length of the code vector 


CLR forces the ALU output to zero and clears the internal BCD flip-flops 
used in excess-3 BCD operations. 
Nap 
forces 
the ALU output 
to zero, but does not affect 
the flip-flops. 
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configuration 
options 


The' ACT8832A 
can be configured 
to operate in 8-bit, 
16-bit, 
or 32-bit 
modes, depending 
on the setting 


of the configuration 
mode selects (CF2-CFO). Table 4 shows the control inputs for the four operating modes. 
Selecting an operating 
configuration 
other than 32-bit 
mode affects 
ALU operation 
and status generation 


in several ways, 
depending 
on the mode selected. 


masked 32-bit 
operation 


Masked 32-bit 
operation 
is selected to reset to zero the 20 most significant 
bits of the R Mux input. The 


12 least significant 
bits are unaffected 
by the mask. Only Group 1 and Group 2 instructions 
can be used 


in this operating 
configuration. 
Status generation 
is similar to unmasked 
32-bit 
operating 
mode. 


shift 
instructions 


Shift 
instructions 
operate similarly 
in 8-bit, 
16-bit, and 32-bit 
modes. The serial I/O (5103'-5100') 
pins 
are used to select end-fill bits or to shift bits in or out, depending on the operation being performed. 
Table 6 


shows the 510 signals associated 
with each byte or word in the different 
modes, and Table 11 indicates 


the specific 
function 
performed 
by the 510 pins during shift, 
multiply, 
and divide operations. 


Figures 6 and 7 present examples 
of logical right shifts 
in 16-bit 
and 8-bit configurations. 


SERIAL DATA 
INPUT SIGNALS 


SIOO 


SERIAL DATA 
INPUT SIGNALS 


SIOO 
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SERIAL DATA 
INPUT SIGNALS 


5100 
5101 
5102 


SERIAL DATA 
INPUT SIGNALS 
SiOO • 
5101 
5102 


bit and byte instructions 


The' ACT8832A 
performs 
bit operations 
similarly 
in 8-bit, 
16-bit, 
and 32-bit 
modes. Masks are loaded 


into the R MUX on the A3-AO and C3-CO address inputs, 
and the bytes to be masked are selected 
by 


pulling their 510' 
inputs low. 
Instructions 
that set, reset, or test bits are explained 
later. 


Byte operations should be performed in 32-bit mode to get the necessary status outputs. While byte overflow 
signals are provided for all four bytes (BYOF3-BYOFO), the other status signals (C, N, Z) are output 
only 
for the word selected 
with 
the configuration 
control 
signals (CF2-CFOl. 


------ 
status 
selection 


Status results (C, N, Z, and overflow) 
are internally generated for all words in all modes, but only the overflow 


results 
(BYOF3-BYOFO) are available for all four bytes in 8-bit mode or for both words 
in 16-bit 
mode. 


If a specific 
application 
requires that the four status results are read for two 
or four words, 
it is possible 


to toggle the configuration 
control signals (CF2-CFO) within 
the same clock cycle and read the additional 


status results. This assumes that the necessary external hardware is provided to toggle CF2-CFO and collect 
the status 
for the individual 
words 
before the next clock signal is input. 
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absolute maximum ratings over operating free-air temperature 
range (unless otherwise 
noted) t 


Supply voltage, 
Vcc(see 
Note 1) 
-0.5 V to 6 V 
Input clamp current, 
11K (VI < 0 or VI > VCC) 
. . . . . . . . . . . . . . . . 
± 20 mA 
Output 
clamp current, 
10K 
(VO < 0 or Vo > VCC) 
.... 
± 50 mA 
Continuous 
output 
current, 
10 (VO = 0 to VCC) . . . . . . . 
± 50 mA 
Continuous 
current through 
VCC or GND pins. 
. . . . . . . . 
± 100 mA 
Operating 
free-air temperature 
range. 
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. 
0 DC to 70 DC 
Storage temperature 
range 
- 65 DC to 150 DC 


lStresses beyond those listed under "absolute maximum ratings" may cause permanent damage to the device. These are stress ratings 


only and functional operation of the device at these or any other conditions beyond those indicated under "recommended operating 
conditions" 
is not implied. Exposure to absolute-maximum-rated conditions for extended periods may affect device reliability. 


NOTE 1: All voltage levels are with respect to GND. 


PARAMETER 
MIN 
NOM 
MAX 
UNIT 


Vee 
Supply voltage 
4.75 
5.0 
5.25 
V 


V,H 
High-level input voltage 
2 
Vee 
V 


V,L 
Low-level input voltage 
0 
0.8 
V 


'OH 
High-level output current 
-8 
mA 


'OL 
Low-level output current 
8 
mA 


V, 
Input voltage 
0 
Vee 
V 


Vo 
Output voltage 
0 
Vee 
V 


dtldv 
Input transition rise or fall rate 
0 
15 
nslV 


TA 
Operating free-air temperature 
0 
70 
°e 


electrical characteristics over recommended ranges of supply voltage and free-air temperature (unless 
otherwise 
noted) 


PARAMETER 
TEST CONDITIONS 
VCC 
MIN 
TYP* 
MAX 
UNIT 


4.75 V 
4.6 
'OH = -20 
p.A 
5.25 V 
5.1 
VOH 
V 
4.75 V 
3.85 
3.95 


'OH = 
-8 
mA 
5.25 V 
4.60 
4.70 


4.75 V 
0.1 


'OL = 20 p.A 
5.25 V 
0.1 
VOL 
V 
4.75 V 
0.32 
0.45 


10L = 8 mA 
5.25 V 
0.32 
0.45 


II§ 
VI = Vee or 0 
5.25 V 
±0.1 
±1 
p.A 


IIESI03-0 
VI = Vee or 0 
5.25 V 
±5 
mA 


leeQ 
V, = Vee or 0 
5.25 V 
100 
200 
p.A 


ei 
V, = Vee or 0 
5V 
10 
pF 


alee' 
One input at 3.4 V, other inputs at 0 or Vee 
5.25 V 
1 
mA 


*AII typical values are at Vee = 5 V, TA = 25°e. 
§For 1/0 ports, the parameter 'OZH and 00ZL include the offstate output current. 
1This is the increase in supply 'current for each input that is at one of the specified TTL voltage levels rather then 0 V to Vee. 
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maximum switching characteristics 
over recommended 
ranges of supply voltage and operating free- 
air temperature 
(unless otherwise 
noted) t 


TO (OUTPUT) 


PARAMETER 
FROM (INPUT) 
PA/B 
UNIT 


Y 
C 
Z 
SIO 
PERRA/B 
N 
OVR 
DA/B 
PY 
PERRY 
MSERR 


A5-AO,B5-BO 
36 
30 
37 
28 
30 
37 
16 
37 


DA31-DAO,PA3-PAO 
36 
25 
37 
25 
20 
28 
37 
37 
DB31-DBO,PB3-PBO 


Cn 
30 
22 
31 
24 
28 
28 
32 


EA 
37 
28 
37 
25 
31 
37 
37 


EB1-EBO 
37 
2B 
37 
25 
31 
37 
37 


17-10 
37 
30 
37 
28 
32 
37 
37 


CF2-CFO 
37 
30 
37 
28 
32 
37 
37 


OEB,OEA 
15 


tpd 
OEY3-0EY0 
ns 
20 
20 


SElMQ 
15 
20 


S103-S100 
15 
25 
25 
27 


ClK 
21 
28 


ClK 
(w/MQ register 
37 
37 
feedback) 


RFClK 
37 
32 
37 
24 
32 
37 
37 


iESi03-iESiOo 
15 
25 
15 
25 
27 


SSF 
25 
30 
22 
30 
22 
30 


Y 
15 
15 


timing requirements over recommended 
ranges of supply voltage and operating free-air temperature 


(unless otherwise 
noted) 


file write 
setup 


PARAMETER 
MIN 
MAX 
UNIT 


C5-CO 
4 


DA/B32-DA/BO, 
PA/B3-PA/BO 
7 


17-10 
13 


0EY3-0EYO 
7 


Y31-YO 
4 


tsu 
WE3-WEO 
4 


SElRFIDA,DB.PA.PB) 
ns 
5 
SElRFIYI 
9 


SIO 
10 


SEl.MQ 
9 


IESI03-iESiOo 
10 


th 
All 
0 
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TIMING 
CLOADt 
IOL 
IOH 
VLOAD 
PARAMETERS 
(pF) 
(mA) 
(mA) 
(V) 


tpZH 
0 
ten 
tpZL 
50 
8 
-8 
3 


tdls 
tpHZ 
-8 
tpLZ 
50 
8 
1.5 


tcd 
50 
8 
-8 
* 
t CLOAD includes probes and test fixture capacitance. 
* VLOAD - VOL = 50 Q, where VOL = 0.6 V, IOL = 8 mA. 


IOL 
(s-;E~ 
(~~ -'S:- :: 


teu~th 


~ 


---3V 
Det. 
2.7 V 
2.7 V 
1.5 V 
Input 0.3 V 
1.5 V 
D.3 vi . 
0 V 
t,~ 
14- 
~ 
~tf 


VOLTAGE WAVEFORMS 
SETUP AND HOLD TIMES 
INPUT RISE AND FALL TIMES 


Output 
Control 
(Iow·level 
enabling) 
.----- 
- 
- 
- 
- 
3 V 
Input 
L 1.5 V 
"- 
1.5 V 
(S- 
Note A) --Ii 
~- 
O V 


tpd 
14 
.1 
i+-----+L- tpd 
I 
! 
1- - 
VOH 
In-o"~: 
I 
11.5V I 
~V 
--1--' 
I 
VOL 
I~ 
_. 
t4---+'- 
tpd 
tpd~ 
I 
\L 
~ 
VOH 
'\ 
1.5 V 
T ~.5_V 
VOL 


W.veform1 
(S-NoteB) 


Notes: 
A. Phase relationships between waveforms were chosen arb~rarily. All input pulses are supplied by pulse generators having the following 
characteristics: 
PRR = 1 MHz, Zo = 50 n, tr ~ 6 ns, tf ~ 6 ns. 


B. Waveform 
1 Is for an output w~h internal cond~ions such that the output is low except when disabled by the output control. 


C. Waveform 2 is for an output w~h internal conditions such that the output Is high exoept when disabled by the output control. For tPLZ 
and tPHZ, VOL and VOH are specified values. 
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OVR 


C 


1 if result = 0 


1 if MSB of result = 1 
o if MSB of result = 0 


1 if signed arithmetic overflow 


1 if carry-out 
= 1 


ZERO 


N 


tc is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 


EXAMPLE (assumes a 32-bit configuration) 


Add data in register 1 to data in register 10 with carry-in and store the unshifted result 
in register 1. Shift the contents of the MQ register one bit to the right, retaining the 
sign bit. 


Instr 
Oprd 
Oprd 
Oprd Sel 
Dest 
Destination Selects 


Code 
Addr 
Addr 
EB1- 
Addr 
WE3- 
SELRF1- 
0EY3- 
CF2- 


17-10 
A5-AO 
B5-BO 
E'AEBO 
C5·CO 
SELMO 
WEO 
SELRFO OEA 
OEB 
OEYO 
OES 
Cn 
CFO 


10100001 
00 0001 
00 1010 
0 
00 
00 0001 
0 
0000 
10 
X 
X 
XXXX 
0 
1 
110 


Assume register file 1 holds 5608C618 
(Hex), register file 10 holds 14007530 
(Hex), 
and MQ register holds 98A99AOE (Hex). 


Source 
0101 01100000 
1000 1100 0110 0001 
1000 I 
R +- RF(1) 


Source 
00010100000000000111010100110000 
I 
S +- RF(10) 


Destination 
0110 
10100000 
1001 0011 
1011 0100 
1001 I 
RF(1) +- R + S + Cn 


Source 
1001 1000 1010 1001 1001 10100000 
1110 I 
MQ shifter 
+- MQ register 


Destination 
1100 1100 0101 0100 
1100 1101 0000 0111 I 
MQ register 
+- MQ shifter 


Passesthe result of the ALU instruction specified in the upper nibble of the instruction 
field to Y MUX. Performs a right shift on MQ. 


The result of the arithmetic or logical operation specified in the lower nibble of the 
instruction 
field (13-10)is passed unshifted to Y MUX. 


The contents of the MQ register are shifted one bit to the right. A zero is placed in 
the sign bit of the most significant 
byte unless the SIO input for that byte is set to 
zero; this will force the sign bit to 1. Bit 0 of the least significant 
byte is dropped. 


The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to the MQ register. If SSF is low, the MQ register will not be altered. 


ALU Shifter 


None 


MQ Shifter 


Logical Right 


RF 
RF 


(C5-CO) 
Y-Port 
(B5-BO) 


Yes 
No 
Yes 


Signal 
User 


Programmable 
Use 


SSF 
Yes 
Passes shift result if high or floating; 
retains MQ 


without 
shift if low. 


~O 
Yes 
Fills a zero in LSB of MQ shifter if high or floating; 


sets LSB to one if low. 
~1 
No 
Inactive in 32-bit configuration; 
used in other 


Si02 
No 
configurations 
to select end-fill in LSBs. 


Si03 
No 


Cn 
No 
Affects 
arithmetic 
operation programmed in bits 


13-10of instruction 
field. 


OVR 


C 


1 if result 
= 0 


1 if MSB of result = 1 
o if MSB of result = 0 


1 if signed arithmetic 
overflow 


1 if carry-out 
= 1 


ZERO 


N 


tc is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 


after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 


Add data in register 1 to data on the DB bus with carry-in and store the unshifted 
result in register 1. Shift the contents of the MQ register one bit to the left. 


Instr 
Op,d 
Oprd 
Op,d Sel 
Dest 
Destination Selects 
- 
OEY3- 
Code 
Add' 
Add, 
EB1- 
Add, 
WE3- 
SElRF1- 
CF2- 


17·10 
AS-AO 
B5·80 
E'AE80 
C5-CO 
SELMa 
WEO 
SELRFO OEA 
OEB 
OEYO 
OES 
Cn 
CFO 


1011 0001 
00 0001 
XX XXXX 
0 
10 
00 0001 
0ססoo 
10 
x 
X 
xxxx 
0 
1 
110 


Assume register file 1 holds 5608C618 
(Hex), DB bus holds 14007530 
(Hex), and 


MQ register holds 98A99AOE (Hex). 


Source 
0101 0110ססoo 1000 1100 0110 0001 
1000 I 
R - 
RF(1) 


Source 
0001 0100ססooססoo0111 0101 0011ססoo I 
S - 
DB bus 


Destination 
0110 
1010ססoo 1001 0011 
1011 0100 
1001 I 
RF(1) - 
R + S + Cn 


Source 
1001 1000 1010 1001 1001 1010ססoo 1110 I 
MQ shifter 
- 
MQ register 


Destination 
0100 
1100 0101 0100 
1100 1101ססoo011; I 
MQ register 
- 
MQ shifter 


Data on the R bus is NANDed with data on the S bus. The result appears at the ALU 
and MQ shifters. 


·The result of this instruction 
can be shifted 
in the same microcycle 
by specifying 
a shift 
instruction 
in the 


upper nibble (17-14)of the instruction 
field. The result may also be passed without 
shift. Possible instructions 


are listed in Table 15. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
(A5-AO) 
Immed 
A3-AO 


Mask 


Yes 
No 
Yes 
No 


RF 
MQ 


(B5-BO) 
DB-Port 
Register 


Yes 
Yes 
Yes 


RF 
RF 
ALU 
MQ 


(C5-CO) 
(B5-BO) 
Y-Port 
Shifter 
Shifter 


Yes 
No 
Yes 
Yes 
Yes 


Signal 
User 


Programmable 
Use 


SSF 
No 
Affect 
shift 
instructions 
programmed 
in bits 17-14 of 


SiDo 
No 
instruction 
field. 


SiD1 
No 


Si02 
No 


Si03 
No 


Cn 
Inactive 


ZERO 


N 


OVR 


C 


1 if result 
= 0 


1 if MSB = 1 
o 
o 


tc is ALU carry out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) 
is evaluated after ALU operation and after shift operation. 


Logically NAND the contents of register 3 and register 5, and store the result 
in register 5. 


Inst. 
Op.d 
Op.d 
Op.d Sel 
Dest 
Destination 
Selects 


Code 
Add. 
Add. 
EB1- 
Add. 
WE3- 
SELRF1- 
OEY3- 
CF2- 


17-10 
A5-AO 
B5-6O 
EA EBO 
C5-CO 
SELMa 
WEO 
SELRFO 
OEA 
OEB 
OEYO 
OES 
Cn 
CFO 


1111 
1100 
00 0011 
00 0101 
0 
00 
00 0101 
0ססoo 
10 
x 
X 
XXXX 
0 
X 
110 


Source 
0110ססoo 1111 0110 
1101 
1000 
0100ססoo I 
R +- RF(3) 


This instruction 
forces the ALU output to zero. The BCD flip-flops 
retain their old value. 
Note that the clear instruction 
(CLR) forces the ALU output to zero and clears the BCD 


flip-flops. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
(A5-AO) 
Immed 
A3-AO 


Mask 


No 
No 
No 
No 


RF 
MQ 


(B5-BO) 
DB-Port 
Register 


No 
No 
No 


RF 
RF 


(C5-CO) 
(B5-BO) 
Y-Port 


Yes 
No 
Yes 


Status Signals 


I 


ZER~ 
= ~ 


OVR 
= 0 


C 
= 0 


Instr 
Oprd 
Oprd 
Oprd sel 
Dest 
Destination 
Selects 


Code 
Addr 
Addr 
EB1- 
Addr 
WE3- 
sELRF1- 
OEY3- 
CF2- 


17-10 
A5-AO 
B5-SO 
EAEao 
C5-CO 
SELMa 
WEO 
sELRFO 
OEA 
OEB 
OEYO 
OES 
Cn 
CFO 


1111 
1111 
XX XXXX 
xx XXXX 
x xx 
00 
1100 
0ססoo 
10 
x 
X 
XXXX 
0 
X 
110 


D~stination 
Iססooססooססooססooססooססooססooססoo I 
RF( 12) 
+- 0 


Data on the R bus is NORed with data on the S bus. The result appears at the ALU 
and MQ shifters. 


·The result of this instruction 
can be shifted in the same microcycle by specifying a shift instruction 
in the 
upper nibble (17-14)of the instruction field. The result may also be passed without shift. Possible instructions 
are listed in Table 15. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
(A5-AO) 
Immed 
A3-AO 


Mask 


Yes 
No 
Yes 
No 


RF 
MQ 


DB-Port 
(B5-BO) 
Register 


Yes 
Yes 
Yes 


RF 
RF 
ALU 
MQ 


(C5-CO) 
Y-Port 
Shifter 
(B5-BO) 
Shifter 


Yes 
No 
Yes 
Yes 
Yes 


Signal 
User 


Use 
Programmable 


SSF 
No 
Affect 
shift 
instructions 
programmed 
in bits 17-14 of 


SiDo 
No 
instruction 
field. 


SiD1 
No 


SiD2 
No 


SiD3 
No 


Cn 
No 
Inactive 


ZERO 


N 


OVR 


C 


1 if result 
= 0 


1 if MSB = 1 
o 
o 


tc is ALU carry out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) 
is evaluated after ALU operation and after shift operation. 


Logically NOR the contents of register 3 and register 5, and store the result 
in register 5. 


Inst, 
Op,d 
Op,d 
Op,d sel 
Dest 
Destination Selects 


Code 
Add, 
Add, 
EB1- 
Add, 
WE3- 
sELRF1- 
OEY3- 
CF2- 


17-10 
A5-AO 
B5-80 
EAE80 
C5-CO 
SELMa 
WEO 
sELRFO OEA 
OEB 
OEYO 
OEs 
Cn 
CFO 


1111 1011 
00 0011 
00 0101 
0 
00 
00 0101 
0ססoo 
10 
x 
X 
XXXX 
0 
X 
110 


Source 
01100000 
1111 01101101 
10000100ססoo I 
R - 
RF(3) 


Source 
00010011 
111101101101001101110111 
I 
S - 
RF(5) 


Destination 
1000 1100ססoo 1001 00100100 
1000 1000 I 
RF(5) - 
R NOR S 


Data on the R bus is ORed with data on the S bus. The result appears at the ALU 
and MQ shifters. 


'The 
result of this instruction 
can be shifted 
in the same microcycle 
by specifying 
a shift 
instruction 
in the 
upper nibble (17-14)of the instruction 
field. The result may also be passed without 
shift. Possible instructions 
are listed in Table 15. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
(A5-AO) 
Immed 
A3-AO 


Mask 


Yes 
No 
Yes 
No 


RF 
MQ 
DB-Port 
Register 
(B5-BO) 


Yes 
Yes 
Yes 


RF 
RF 
ALU 
MQ 


(C5-CO) 
Y-Port 
(B5-BO) 
Shifter 
Shifter 


Yes 
No 
Yes 
Yes 
Yes 


User 
Signal 
Programmable 
Use 


SSF 
No 
Affect 
shift 
instructions 
programmed 
in bits 17-14 of 


mO 
No 
instruction 
field. 
SiC1 
No 


m2 
No 


m3 
No 


Cn 
No 
Inactive 


ZERO 


N 


OVR 


C 


1 if result 
= 0 


1 if MSB = 1 
o 
o 


tc is ALU carry out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 


Logically OR the contents of register 5 and register 3, and store the result in 
register 3. 


Instr 
Oprd 
Oprd 
Oprd Sel 
Dest 
Destination 
Selects 


Code 
Addr 
Addr 
EB1- 
Addr 
WE3- 
SElRF1- 
OEY3- 
CF2- 


17-10 
A5-AO 
B5-80 
EAEeo 
C5-CO 
SELMa 
WEO 
SElRFO 
OEA 
OEB 
OEYO 
OES 
Cn 
CFO 


1111 
1011 
00 0101 
00 0011 
0 
00 
00 0011 
0ססoo 
10 
x 
X 
XXXX 
0 
X 
110 


Source 
0110ססoo 1111 
0110 
1101 
1000 
0100ססoo I 
R - 
RF(5) 


Passesthe result of the ALU instruction specified in the lower nibble of the instruction 
field to Y MUX. 


The result of the arithmetic or logical operation specified in the lower nibble of the 
instruction 
field (13-10)is passed unshifted to Y MUX. 


RF 
RF 
ALU 
MQ 


Y-Port 


(C5-CO) 
(B5-BO) 
Shifter 
Shifter 


Yes 
No 
Yes 
None 
None 


User 
Signal 
Use 


Programmable 


SSF 
No 
Inactive 


5100 
No 
Inactive 


5101 
No 
Inactive 


5102 
No 
Inactive 


5103 
No 
Inactive 


Cn 
No 
Affects 
arithmetic 
operation specified in bits 13-10of 


instruction 
field. 


ZERO 


N 


OVR 


C 


1 if result = 0 


1 if MSB of result = 1 
o if MSB of result = 0 
1 if signed arithmetic overflow 


1 if carry-out condition 


tc is ALU carry out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) 
is evaluated after ALU operation and after shift operation. 


Add data in register 1 to data on the DB bus with carry-in and store the unshifted 
result in register 10. 


Inst' 
Op,d 
Op,d 
Op,d Sel 
Dest 
Destination 
Selects 


Code 
Add, 
Add, 
EB1- 
Add, 
WE3- 
SELRF1- 
OEY3- 
CF2- 


17-10 
A5-AO 
B5-9O 
EA E90 
C5-CO 
SELMa 
WEO 
SELRFO OEA 
OEB 
OEYO 
OES 
Cn 
CFO 


11110001 
00 0001 
XX xxx x 
0 
10 
00 1010 
0ססoo 
10 
x 
X 
XXXX 
0 
1 
110 


Source 
10010011ססoo 1000 1100011000011000 
I 
R +- RF(1) 


Source 
00100100ססooססoo 0111 0101 0011ססoo IS+- 
DB bus 


Destination 
10110111ססoo 10010011 
10110100 
1001 I 
RF(10) 
+- R + S + en 


Signed Divide Iterate 


Performs one of N-2 iterations of nonrestoring signed division by a test subtraction 
of the N-bit divisor from the 2N-bit dividend. An algorithm using this instruction 
is 
given in the "Other 
Arithmetic 
Instructions" 
section. 


SDIVI performs a test subtraction of the divisor from the dividend to generate a quotient 
bit. The test subtraction 
passes if the remainder is positive and fails if negative. If 
it fails, the remainder will be corrected during the next instruction. 


SDIVI checks the pass/fail result of the test subtraction from the previous instruction, 
and evaluates 


F •... 
R+S 
F •... R' + S + Cn 


if the test fails 
if the test passes 


A double precision left shift is performed; bit 7 of the most significant byte of the MQ 
shifter is transferred to bit 0 of the least significant byte of the ALU shifter. Bit 7 of 
the most significant byte of the ALU shifter is lost. The unfixed quotient bit is circulated 
into the least significant 
bit of the MQ shifter. 


The R bus must be loaded with the divisor, the.S bus with the most significant 
half 
of the result of the previous instruction (SDIVI during iteration or SDIVIS at the beginning 
of iteration). The least significant half of the previous result is in the MQ register. Carry- 
in should be programmed high. Overflow occurring during SDIVI is reported to OVR 
at the end of the signed divide routine (after SDIVQF). 


C3-CO 


RF 
A3-AO 
.. 
(A5-AO) 
Immed 
DA-Port 
A3-AO 
Mask 


Yes 
No 
Yes 
No 


RF 
DB-Port 
MQ 


(B5-BO) 
Register 


Yes 
Yes 
No 


RF 
RF 
Y-Port 


(C5-COI 
(B5-BO) 


Yes 
No 
Yes 


User 


Signal 
Use 
Programmable 


SSF 
No 
Inactive 


SiOo 
No 
Pass internally 
generated 
end-fill 
bits. 


Si01 
No 
Si02 
No 
Si03 
No 


Cn 
Yes 
Should 
be programmed 
high 


ZERO 
1 if intermediate 
result = 0 


N 
0 


OVR 
0 


C 
1 if carry-out 


Initializes' ACT8832 for nonrestoring signed division by shifting the dividend left and 
internally preserving the sign bit. An algorithm using this instruction 
is given in the 
"Other 
Arithmetic 
Instructions section. 


This instruction prepares for signed divide iteration operations by shifting the dividend 
and storing the sign for future use. 


The preceding instruction should load the MQ reqister with the least significant half 
of the dividend. During SDIVIN, the S bus should be loaded with the most significant 
half of the dividend, and the R bus with the divisor. V-output should be written back 
to the register file for use in the next instruction. 


A double precision logical left shift is performed; bit 7 of the most significant 
byte 
of the MQ shifter is transferred to bit 0 of the least significant byte of the ALU shifter. 
Bit 7 of the most significant byte of the ALU shifter is lost. The unfixed quotient sign 
bit is shifted into the least significant 
bit of the MQ shifter. 


C3-CO 


RF 
A3-AO 
.. 
(A5-AO) 
Immed 
DA-Port 
A3-AO 
Mask 


Yes 
No 
Yes 
No 


RF 
DB-Port 
MQ 


(B5-BO) 
Register 
Yes 
Yes 
No 


RF 
RF 
Y-Port 
(C5-CO) 
(B5-BO) 


Yes 
No 
Yes 


User 
Use 
Signal 
Programmable 


SSF 
No 
Inactive 
SiOo 
No 
Pass internally 
generated 
end-fill 
bits. 
SiD1 
No 
SiD2 
No 
SiD3 
No 


Cn 
No 
Inactive 


I 


ZER~ 
= ~ if divisor 
= 0 


OVR 
= 0 


Cn 
= 0 


Computes the first quotient bit of nonrestoring signed division. An algorithm using 
this instruction 
is given in the "Other 
Arithmetic 
Instructions" 
section .. 


SDIVIS computes the first quotient bit during nonrestoring signed division by subtracting 
the divisor from the dividend, which was left-shifted during the prior SDIVIN instruction. 
The resulting remainder due to subtraction 
may be negative. If so, the subsequent 
SDIVI instruction 
will restore the remainder during the next subtraction. 


The R bus must be loaded with the divisor and the S bus with the most significant 
half of the remainder. The result on the Y bus should be loaded back into the register 
file for use in the next instruction. The least significant half of the remainder is in the 
MO register. Carry-in should be programmed high. 


A double precision left shift is performed; bit 7 of the most significant 
byte of the 
MO shifter is transferred to bit 0 of the least significant byte of the ALU shifter. Bit 7 
of the most significant 
byte of the ALU shifter is lost. The unfixed quotient bit is 
circulated into the least significant 
bit of the MO shifter. 


Overflow occurring during SDIVIS is reported to OVR at the end of the signed division 
routine (after SDIVOF). 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
(A5-AOI 
Immed 
A3-AO 


Mask 


Yes 
No 
Yes 
No 


RF 
MQ 


(B5-BOI 
DB-Port 
Register 


Yes 
Yes 
No 


RF 
RF 


(C5-COI 
Y-Port 
(B5-BOI 


Yes 
No 
Yes 


Signed Divide Start 


User 
Signal 
Use 
Programmable 


SSF 
No 
Inactive 


SiOo 
No 
Pass internally 
generated 
end-fill 
bits. 
Si01 
No 


SI02 
No 
Si03 
No 


Cn 
Yes 
Should be programmed 
high. 


ZERO 
1 if intermediate 
result 
= 0 


N 
0 


OVR 
0 


C 
1 if carry-out 


Solves 
the 
final 
quotient 
bit 
during 
nonrestoring 
signed 
division. 
An 
algorithm using this instruction is given in the "Other Arithmetic Instructions" 
section. 


SOIVIT performs the final subtraction 
of the divisor from the remainder during 
nonrestoring signed division. SOIVIT is preceded by N-2 iterations of SOIVI, where 
N is the number of bits in the dividend. 


The R bus must be loaded with the divisor, and the S bus must be loaded with the 
most significant half of the result of the last SOIVI instruction. The least significant 
half lies in the MQ register. The Y bus result must be loaded back into the register 
file for use in the subsequent OIVRFinstruction. Carry-in should be programmed high. 


SOIVIT checks the pass/fail result of the previous instruction's 
test subtraction and 
evaluates; 


Y+-R+S 
Y +- R' + S + Cn 


if the test fails 
if the test passes 


The contents of the MQ register are shifted one bit to the left; the unfixed quotient 
bit is circulated into the least significant bit. 


Overflow during this instruction is reported to OVR at the end of the signed division 
routine (after SOIVQF). 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
(A5-AO) 
Immed 
A3-AO 


Mask 


Yes 
No 
Yes 
No 


RF 
MQ 
DB-Port 
(B5-BO) 
Register 


Yes 
Yes 
No 


RF 
RF 


(C5-CO) 
Y-Port 
(B5-BO) 


Yes 
No 
Yes 


User 


Signal 
Use 
Programmable 


SSF 
No 
Inactive 


SiOo 
No 
Pass internally generated end-fill bits. 
SiD1 
No 
SiD2 
No 
SiD3 
No 


Cn 
Yes 
Should be programmed high 


ZERO 
1 if intermediate result = 0 


N 
0 


OVR 
0 


C 
1 if carry-out 


Tests for overflow 
during nonrestoring 
signed division. 
An algorithm 
using this 
instruction 
is given in the "Other 
Arithmetic 
Instructions section. 


This instruction performs an initial test subtraction of the divisor from the dividend. 
If overflow is detected, it is preserved internally and reported at the end of the divide 
routine (after SOIVOF). If overflow status is ignored, the SOIVO instruction may be 
omitted. 


The divisor must be loaded onto the R bus; the most significant half of the previous 
SOIVIN result must be loaded onto the S bus. The least significant half is in the MO 
register. 


The result on the Y bus should not be stored back into the register file; WE' should 
be programmed high. 


Carry-in should also be programmed high. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
(A5-AO) 
Immed 
A3-AO 


Mask 


Yes 
No 
Yes 
No 


RF 
MQ 
D8-Port 
(85-80) 
Register 


Yes 
Yes 
No 


RF 
RF 


(C5-CO) 
Y-Port 
(85-80) 


Yes 
No 
Yes 


User 
Use 
Signal 
Programmable 


SSF 
No 
Inactive 
SiDo 
No 
Inactive 
SiD 1 
1110 
Inactive 
SiD2 
No 
Inactive 
SiD3 
No 
Inactive 


Cn 
Yes 
Should 
be programmed 
high 


ZERO 
1 if divisor 
= 0 
N 
0 


OVR 
0 


C 
1 if carry-out 


Signed Divide Quotient Fix 


Tests the quotient result after nonrestoring signed division and corrects it if necessary. 
An algorithm using this instruction 
is given in the "Other 
Arithmetic 
Instructions" 
section. 


SDIVOF is the final instruction required to compute the quotient of a 2N-bit dividend 
by an N-bit divisor. It corrects the quotient if the signs of the divisor and dividend are 
different and the remainder is nonzero. 


The fix is implemented by incrementing S: 


Y"'S+l 
Y •..S+O 


if a fix is required 
if no fix is required 


The R bus must be loaded with the divisor, and the S bus with the most significant 
half of the result of the preceding DIVRF instruction. 
The least significant half is in 
the MO register. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
(A5-AO) 
Immed 
A3-AO 


Mask 


Yes 
No 
Yes 
No 


RF 
MQ 


DB-Port 
(B5-BO) 
Register 


Yes 
Yes 
No 


RF 
RF 


(C5-CO) 
Y-Port 
(B5-BO) 


Yes 
No 
Yes 


Signed Divide Quotient Fix 


User 
Signal 
Use 
Programmable 


SSF 
No 
Inactive 


5100 
No 
Inactive 


5101 
No 
Inactive 


Si02 
No 
Inactive 


5103 
No 
Inactive 


Cn 
Yes 
Should be programmed high 


ZERO 


N 


OVR 


C 


1 if quotient 
= 0 


1 if sign of quotient + 1 
o if sign of quotient + 0 


1 if divide overflow 


1 if carry-out 


Data on the S bus is passed to Y if SSF is programmed high or floating; data on the 
R bus is passed without 
carry to Y if SSF is programmed low. 


C3-CO 


RF 
A3-AO 


\ 


DA-Port 
.. 


(A5-AO) 
Immed 
A3-AO 


Mask 


Yes 
No 
Yes 
No 


Available 
S Bus Source 
Operands 
(MSHI 


RF 
MQ 
D8-Port 
(85-80) 
Register 


Yes 
Yes 
Yes 


RF 
RF 


(C5-COI 
(85-801 
Y-Port 


Yes 
No 
Yes 


User 
Signal 
Programmable 
Use 


SSF 
Yes 
Selects 
S if high, R if low. 


SIOO 
No 
Inactive 


SK>1 
No 
Inactive 


SK>2 
No 
Inactive 


SiC3 
No 
Inactive 


Cn 
No 
Inactive 


ZERO 


N 


OVR 


C 


1 if result 
= 0 


1 if MSB = 1 
o 
o 


Compare the two's 
complement numbers in registers 1 and 3 and store the larger in 
register 5. 


1. Subtract (SUBS) data in register 3 from data in register 1 and pass the result 
to the Y bus. 
2. Perform Select SIR instruction 
and pass result to register 5. 


[This example assumes the SSF is set by the negative status (N) from the previous 
instruction]. 


Inst, 
Op,d 
Op,d 
Op,d Sel 
Dest 
Destination Selects 


Code 
Add, 
Add, 
E81- 
Add, 
WE3- 
SELRF1- 
OEY3- 
CF2- 
17-10 
A5-AO 
85-80 
EA E80 
C5-CO 
SELMQ 
WEO 
SELRFO OEA 0Eii 
OEYO DES 
Cn 
CFO 
11110011 
00 0001 
00 0011 
0 
00 
XX XXXX 
0 
XXXX 
XX 
X 
Xססoo 
0 
1 
110 
0001ססoo 
00 0001 
00 0011 
0 
00 
00 0101 
0ססoo 
10 
X 
X 
XXXX 
0 
0 
110 


Assume register file 1 holds 00849700 
(Hex) and register file 3 holds 01 C35250 (Hex). 


Instruction Cycle 1 


Source 
00000000 
10000100 
1001 01111101 
0000 I 
R.- 
RF(l) 


Sourceססoo0001110000110101001001010000 
I 
S'- 
RF(3) 


Destination 
1111 11101100 
00010100 
0101 1000ססoo I 
Y bus .- R + S' + Cn 


Source Iססoo 0000 1000 0100 1001 0111 1101ססoo I 
R.- 
RF(1) 


Q 
SSF'- 1 


Sourceססoo 0001 1100 0011 0101 00100101ססoo I 
S'- 
RF(3) 


Destinationססoo 0001 1100 0011 0101 00100101 
0000 I 
RF(5)'- 
S 


The register addressed by B5-BOis both the source and destination for this instruction. 
The source word is passed on the S bus to the ALU, where it is compared to an a-bit 
mask, 
consisting 
of 
a concatenation 
of the 
C3-CO and A3-AO 
address ports 
(C3-CO::A3-AOI. The mask is input via the R bus. All bits in the source word that are 
in the same bit position as ones in the mask are reset. Bytes with their SIO inputs 
programmed 
low perform the Reset Bit instruction. 
Bytes with 
their SIO inputs 
programmed high or floating pass S unaltered. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 


(AS-AD) 
Immed 
A3-AO 


Mask 


No 
No 
No 
Yes 


Available 
S Bus Source 


Operands 
IMSH) 


RF 
MQ 
DB-Port 
(BS-BO) 
Register 


Yes 
Yes 
Yes 


RF 
RF 
Y-Port 


ICS-CO) 
(BS-BO) 


No 
Yes 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
No 
Inactive 


5100 
No 
Byte-select 


5101 
No 
Byte-select 


5102 
No 
Byte-select 


5103 
No 
Byte-select 


Cn 
No 
Inactive 


ZERO 


N 


OVR 


C 


1 if result (selected bytes) 
o 
o 
o 


EXAMPLE (assumes 
a 32-bit configuration) 


Set bits 3-0 of bytes 1 and 2 of register file 8 to zero and store the result back in 
register 8. 


Instr 
Mask 
Op,d 
Op,d Sel 
Mask 
Destination 
Selects 


Code 
ILSHI 
Add, 
EB1- 
IMSHI 
WE3- SELAF1- 
OEY3- 
CF2- 
5103- 
IESI03- 


17-10 
A3-AO 
B5-BO 
EA EBO 
C3-CO 
SELMa 
WEO 
SELAFO OEA OEB 
OEYO OES Cn CFO 
5100 
IESIOO 


0001 
1000 
1111 
00 1000 
X 
00ססoo 
0ססoo 
10 
x 
X 
XXXX 
0 
X 
110 
1001ססoo 


Destination 
1010ססoo1000ססoo1011 
0000 
1011 
1110 I 
RF(8)n +- Fn or Snt 


tF = ALU result 
n = nth byte 
Register file 8 gets F if byte selected. S if byte not selected. 


The register addressed by B5-BOis both the source and destination for this instruction. 
The source word is passed on the S bus to the ALU, where it is compared to an a-bit 
mask, 
consisting 
of 
a concatenation 
of the 
C3-CO and A3-AO 
address 
ports 
(C3-CO::A3-AOl. The mask is input via the R bus. All bits in the source word that are 
in the same bit position as ones in the mask are forced to a logical one. Bytes with 
their SIO inputs programmed low perform the Set Bit instruction. 
Bytes with their 
SIO inputs programmed high or floating pass S unaltered. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
(A5-AOI 
Immed 
A3-AO 


Mask 


No 
No 
No 
Yes 


Available 
S Bus Source 
Operands 
(MSH) 


RF 
MQ 


DB-Port 
(B5-BOI 
Register 


Yes 
Yes 
Yes 


RF 
RF 


(C5-COI 
Y-Port 
(B5-BOI 


No 
Yes 
Yes 


Signal 
User 


Programmable 
Use 


SSF 
No 
Inactive 


SiDo 
Yes 
Byte-select 
SiD1 
No 
Byte-select 
SiD2 
No 
Byte-select 


SI03 
No 
Byte-select 


Cn 
No 
Inactive 


ZERO 


N 


OVR 


C 


1 if result (selected bytes) 
o 
o 
o 


EXAMPLE (assumes a 32-bit configuration) 


Set bits 3-0 of byte 1 of register file 1 to zero and store the result back in register 1. 


Instr 
Mask 
Op,d 
Op,d 
S.I 
Mask 
Destination 
Selects 


Coda 
ILSHI 
Add, 
EB1- 
IMSHI 
WE3- 
SELRF1- 
OEY3- 
CF2- 
S103- 
IESI03- 


17-10 
A3-AO 
B5-80 
EA EBO 
C3-CO 
SELMa 
WEO 
SELRFO 
OEA 
OEB 
OEYO 
OES 
Cn 
CFO 
SIOO 
IESIOO 
ססoo 1000 
1111 
00 
0001 
X 
00ססoo 
0ססoo 
10 
x 
X 
xxx x 
0 
X 
110 
1101ססoo 


t F = ALU result 
n = nth byte 
Register file 1 gets F if byte selected. S if byte not selected. 


Arithmetic Left Single Precision Shift 


Performs arithmetic left shift on result of ALU operation specified in lower nibble of 
instruction field. 


The result of the ALU operation specified in instruction bits 13-10is shifted one bit 
to the left. A zero is filled into bit 0 of the least significant byte of each word unless 
the SIO input is programmed low; this will force bit 0 to one. Bit 7 is dropped from 
the most significant byte in each word, which may be 1, 2, or 4 bytes long, depending 
on the configuration selected. 


The shift may be made conditional on SSF. If SSf is high or floating, the shift result 
will be sent to the MQ register. If SSF is low, the MQ register will not be altered. 


ALU Shifter 


Arithmetic 
Left 


RF 
RF 
Y-Port 
(C5-CO) 
(B5-BO) 


Yes 
No 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
Yes 
Passes shift result if high; passes ALU result if low. 


SIOO 
Yes 
Fills a zero in LSB of each word if high; fills a 


SI01 
Yes 
one in LSB if low. 


SI02 
Yes 


SI03 
Yes 


Cn 
No 
Affects 
arithmetic operation programmed in bits 


13-10of instruction field. 


ZERO 


N 


OVR 


C 


1 if result = 0 


1 if MSB of result = 1 
o if MSB of result = 0 


if signed arithmetic overflow or if MSB XOR MSB-1 


if carry-out condition 


tc is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 


Perform the computation A = 2(A + B), where A and B are single-precision. two's 
complement numbers. Let A be stored in register 1 and B be input via the DB bus. 


Instr 
Op.d 
Op,d 
Op,d Sel 
Dest 
Destination Selects 
- 
- 
- 
Code 
Add. 
Add' 
EB1- 
Add' 
WE3- 
SELRF1- 
OEY3- 
CF2- 
5103- 
IESI03- 
- 
- 
SiCo 
iESiOo 
17-10 
AS·AO 
B5-9O 
EA EBO 
C5-CO 
SELMa 
WEO 
SELRFO 
OEA 
OEB 
OEYO 
OES 
Cn 
CFO 
SSF 


0100 
0001 
00 0001 
XX 
XXXX 
0 
10 
00 0001 
0ססoo 
10 
x 
X 
xxx X 
0 
0 
110 
1110ססoo 
1 


Intermediate 
Result 


Performs arithmetic left shift on MQ register (LSH)and result of ALU operation (MSH) 
specified in lower nibble of instruction field. 


The result of the ALU operation specified in instruction bits 13-10is used as the upper 
half of a double-precision word, the contents of the MQ register as the lower half. 


The contents of the MQ register are shifted one bit to the left. A zero is filled into 
bit 0 of the least significant byte of each word unless the SID input for the word is 
set to zero; this will force bit 0 to one. Bit 7 of the most significant byte in the MQ 
shifter is passed to bit 0 of the least significant byte of the ALU shifter. Bit 7 of the 
most significant byte in the ALU shifter is dropped. 


The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to the Y MUX and MQ register. If SSF is low, the ALU output and MQ 
register will not be altered. 


ALU Shifter 


Arithmetic 
Left 


MQ Shifter 


Arithmetic 
Left 


RF 
RF 
Y-Port 
(C5-COl 
(B5-BOI 


Yes 
No 
Yes 


Signal 
User 
Use 
Programmable 


SSF 
Yes 
Passes shift result if high; passes ALU result if low. 


SIOO 
Yes 
Fills a zero in LSB of each word if high; fills a 
Sial 
Yes 
one in LSB if low. 


SI02 
Yes 


SI03 
Yes 


Cn 
No 
Affects arithmetic operation specified in bits 13-10of 


instruction field. 


OVR 


C 


1 if result = 0 


1 if MSB of result = 1 
o if MSB of result = 0 


1 if signed arithmetic overflow or if MSB XOR MSB-l 


1 if carry-out condition 


ZERO 


N 


tc is ALU carry-out and is evaluated before shift oper~tion. ZERO and N (negative) are evaluated 


after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 


Perform the computation A = 2(A + B), where A and B are two's complement numbers. 
Let A be a double precision number residing in register 1 (MSH) and the MQ register 
(LSH). Let B be a single precision number which is input through the DB bus. 


Instr 
Oprd 
Oprd 
Oprd 5el 
Dest 
Destination Selects 
- 
- 
Code 
Addr 
Addr 
EB1- 
Addr 
WE3· 
SELRF1- 
OEY3- 
CF2- 
S103- 
IESI03- 
- 
- 
- 
OEB 
17-10 
A5-AO 
B5-BO 
EA EBO 
C5-CO 
SELMQ 
WEO 
SELRFO 
OEA 
OEYO 
OES 
Cn 
CFO 
SIOO 
IESIOO 
SSF 


01010001 
00 0001 
XX 
XXXX 
0 
10 
00 0001 
0 
0000 
10 
X 
X 
XXXX 
0 
0 
110 
, "0 
0000 
I 


Assume register file 1 holds 2408C618 
(Hex), DB bus holds 26007530 
(Hex), and 


MQ register holds 50A99AOE (Hex). 


MSH 


Intermediate 
Result 


Circular Left Single Precision Shift 


Performs circular left shift on result of ALU operation specified in lower nibble of 
instruction field. 


The result of the ALU operation specified in instruction bits 13-10is rotated one bit 
to the left. Bit 7 of the most significant byte in each word is passed to bit 0 of the 
least significant byte in the word, which may be 1, 2, or 4 bytes long. 


The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to Y MUX. If SSF is low, F is passed unaltered. 


ALU Shifter 


Circular Left 


MQ Shifter 


None 


RF 
RF 
Y-Port 
(C5-CO) 
{B5-BO} 


Yes 
No 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
Yes 
Passes shift result if high; passes ALU result if low. 


SIOO 
No 
Bit 7 of ALU result 


SI01 
No 
Bit 15 of ALU result 


Si02 
No 
Bit 23 of ALU result 


Si03 
No 
Bit 31 of ALU result 


Cn 
No 
Affects 
arithmetic operation specified in bits 13-10of 


instruction 
field. 


Circular Left Single Precision Shift 


ZERO 


N 


OVR 


C 


1 if result = 0 


1 if MSB of result = 1 
o if MSB of result = 0 


if signed arithmetic overflow 


if carry-out condition 


tc is ALU carry-out and is evaluated before shift operation. ZEROand N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 


Instr 
Oprd 
Oprd 
Oprd Sel 
Dest 
Destination 
Selects 


Code 
Addr 
Addr 
EB1- 
Addr 
WE3- 
SELRF1- 
OEY3- 
CF2- 


17-10 
A5-AO 
B5-BO 
EA EBO 
C5-CO 
SELMQ 
WEO 
SELRFO OEA 
OEB 
OEYO 6ES 
Cn 
CFO 
SSF 


01100110 
000110 
XX XXXX 
0 
00 
000001 
0 
0000 
10 
X 
X 
XXXX 
0 
0 
110 
1 


Intermediate 
Result 


Performs circular left shift on MQ register (LSH)and result of ALU operation specified 
in lower nibble of instruction field (MSH). 


The result of the ALU operation specified in instruction bits 13-10is used as the upper 
half of a double-precision word, the contents of the MQ register as the lower half. 


The contents of the MQ and ALU registers are rotated one bit to the left. Bit 7 of the 
most significant byte in the MQ shifter is passed to bit 0 of the least significant byte 
of the ALU shifter. Bit 7 of the most significant byte is passed to bit 0 of the least 
significant byte in the MQ shifter. 


The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to Y MUX. If SSF is low, F is passed unaltered and the MQ register is 
not changed. 


ALU Shifter 


Circular Left 


MQ Shifter 


Circular Left 


RF 
RF 
Y-Port 
(C5-COI 
(B5-BO) 


Yes 
No 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
Yes 
Passes shift result if high; passes ALU result if low. 


SIOO 
No 
Bit 7 of ALU result 


SI01 
No 
Bit 15 of ALU result 


SI02 
No 
Bit 23 of ALU result 


SI03 
No 
Bit 31 of ALU result 


Cn 
No 
Affects arithmetic operation specified in bits 13-10of 


instruction field. 


ZERO 


N 


1 if result = 0 


1 if MSB of result = 1 
o if MSB of result = 0 


OVR 
if signed arithmetic overflow 


C 
if carry-out condition 


t C is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 


Perform a circular left double precision shift of data in register 6 (MSH) and MQ (LSH), 
and store the result back in register 6 and the MQ register. 


Instr 
Oprd 
Oprd 
Oprd Sel 
Dest 
Destination 
Selects 


Code 
Addr 
Addr 
EB1- 
Addr 
WE3- 
SELRF1- 
OEY3- 
CF2- 


17-10 
A5-AO 
B5-BO 
EA EBO 
C5-CO 
SELMQ 
WEO 
SELRFO OEA 
OEB 
OEYO DES 
Cn 
CFO 
SSF 


01110110 
000110 
XX 
XXXX 
0 
00 
000110 
0 
0000 
10 
X 
X 
XXXX 
0 
0 
110 
1 


Intermediate 
Result 


Destination 
10100001 
0101 
0011 
0011 
01000001 
1100 I 
MQ register"" 
MQ shift result 


The S bus provides the source word for this instruction. The number is converted by 
inverting S and adding the result to the carry-in, which should be programmed high 
for proper conversion; the sign bit of the result is then inverted. An error condition 
will occur if the source word is a negative zero (negative sign and zero magnitude). 
In this case, SMTC generates a positive zero, and the OVR pin is set high to reflect 
an illegal conversion. 


The sign bit of the selected operand in the most significant byte is tested; if it is high, 
the converted number is passed to the destination. Otherwise the operand is passed 
unaltered. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
(A5-AOI 
Immed 
A3-AO 


Mask 


No 
No 
No 
No 


RF 
MQ 


DB-Port 
(B5-BO) 
Register 


Yes 
Yes 
Yes 


RF 
RF 
Y-Port 
(C5-COI 
(B5-BO) 


Yes 
No 
Yes 


Sigm Magnitude/Two's Complement 


User 
Signal 
Use 
Programmable 


SSF 
No 
Inactive 


SIOO 
No 
Inactive 


SIOl 
No 
Inactive 


SI02 
No 
Inactive 


SI03 
No 
Inactive 


Cn 
Yes 
Should be programmed high for proper conversion 


ZERO 


N 


OVR 


1 if result = 0 


1 if MSB = 1 


1 if input of most significant byte is 80 (Hex) and results in all other 


bytes are 00 (Hex). 
1 if S = 0 


Convert 
the two's 
complement 
number in register 
1 to sign magnitude 
representation 
and store 
the result 
in register 
4. 


Instr 
Dp,d 
Dp,d 
Dp,d Sel 
Dest 
Destination Selects 


Code 
Add, 
Add, 
EB1- 
Add, 
WE3- 
SELRF1- 
DEY3- 
CF2- 


17-10 
A5-AO 
B5-BO 
EA EBO 
C5-CO 
SELMQ 
WEO 
SELRFO DEA 
DEB 
DEYO 
DES 
Cn 
CFO 


0101 
1000 
XX 
XXXX 
000001 
X 
00 
000100 
0 
0000 
10 
X 
X 
XXXX 
0 
1 
110 


Source 
11000011111101101101100001000000 
IS+- 
RF(1) 


Destination 
1011 110000001001 
00100111 
11000000 I 
RF(4) +- S' + Cn 


Source 
01010101000010010010011111000000 
IS+- 
RF(l) 


Destination 
0101 0101 00001001 
00100111 
11000000 I 
RF(4) +- S 


Computes one of N-1 signed or N mixed multiplication iterations for computing an 
N-bit by N-bit product. Algorithms for signed and mixed multiplication 
using this 
instruction are given in the "Other Arithmetic 
Instructions" 
section. 


SMUll checks to determine whether the multiplicand should be added with the present 
partial product. The instruction evaluates: 


F +- R + S + Cn 
F+-S 


if the addition is required 
if no addition is required 


A double precision right shift is performed. Bit 0 of the least significant byte of the 
ALU shifter is passed to bit 7 of the most significant byte of the MQ shifter; carry-out 
is passed to the most significant bit of the ALU shifter. 


The S bus should be loaded with the contents of an accumulator and the R bus with 
the multiplicand. The Y bus result should be written back to the accumulator after 
each iteration of UMULI. The accumulator should be clearedand the MQ register loaded 
with the multiplier before the first iteration. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
(A5-AO) 
Immed 
A3-AO 


Mask 


Yes 
No 
Yes 
No 


RF 
MQ 
DB-Port 
(B5-BOI 
Register 


Yes 
Yes 
No 


RF 
RF 


(C5-COI 
Y-Port 
(B5-BO) 


Yes 
No 
No 


User 
Signal 
Use 


Programmable 


SSF 
No 
Inactive 


SIOO 
No 
Passes LSB from ALU shifter to MSB of MQ shifter. 


SIOl 
No 


SI02 
No 


SI03 
No 


Cn 
Yes 
Should be programmed low 


ZERO 
1 if result = 0 


N 
1 if MSB = 1 


OVR 
0 


C 
1 if carry-out 


Signed Multiply.Terminate 


Performs the final iteration for computing an N-bit by N-bit signed product. An algorithm 
for signed multiplication 
using this instruction 
is given in the "other 
Arithmetic 
Instructions" 
section. 


SMUll 
checks the present multiplier. bit (the least significant bit of the MQ register) 


to determine whether the multiplicand should be added with the present partial product. 
The instruction 
evaluates: 


F +- R' + S + en 
F+-S 


if the addition is required 
if no addition is required 


with the correct sign in the product. 


A double precision right shift is performed. Bit 0 of the least significant byte of the 
ALU shifter is passed to bit 7 of the most significant byte of the MQ shifter. 


The S bus should be loaded with the contents of an register file holding the previous 
iteration result; the Rbus must be loaded with the multiplicand. After executing SMULT, 
the Y bus contains the most significant half of the product, and MQ contains the least 
significant 
half. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 


(A5-AO) 
Immed 
A3-AO 


Mask 


Yes 
No 
Yes 
No 


RF 
MQ 


DB-Port 
(B5-BOI 
Register 


Yes 
Yes 
No 


RF 
RF 
Y-Port 
(C5-CO) 
(B5-BO) 


Yes 
No 
No 


Signed Multiply 
Terminate 


User 
Signal 
Use 
Programmable 


SSF 
No 
Inactive 


SIOO 
No 
Passes LSB from ALU shifter to MSB of MQ shifter. 


SI01 
No 


SI02 
No 


SI03 
No 


Cn 
Yes 
Should be programmed high 


ZERO 


N 


OVR 


C 


1 if result = 0 


1 if MSB = 1 
o 


Single-Length Normalize 


Tests the two most significant bits of the MQ register. If they are the same, shifts 
the number to the left. 


This instruction is used to normalize a two's complement number in the MQ register 
by shifting the number one bit position to the left and filling a zero into the LSB (unless 
the SID input for that word is low). Data on the S bus is added to the carry, permitting 
the number of shifts performed to be counted and stored in one of the register files. 


The shift and the S bus increment are inhibited whenever normalization is attempted 
on a number already normalized. Normalization is complete when overflow occurs. 


C3-CO 


RF 
A3-AO 
.. 


DA-Port 
(A5-AO) 
Immed 
A3-AO 


Mask 


No 
No 
No 
No 


RF 
MQ 
DB-Port 
(B5-BO) 
Register 


Yes 
No 
No 


Available Destination Operands 


(Count) 


Shift Operations 


(Conditional) 


RF 
RF 
Y-Port 
(C5-CO) 
(B5-BO) 


Yes 
No 
Yes 


Single-Length Normalize 


User 


Signal 
Programmable 
Use 


SSF 
No 
Inactive 


SIOO 
No 
Passes internally generated end-fill bit. 


SIOl 
No 


SI02 
No 


SI03 
No 


Cn 
Yes 
Increments S bus (shift count) if set to one. 


ZERO 


N 


OVR 


C 


if result = 0 


if MSB of MQ register = 1 


if MSB of MQ register XOR 2nd MSB 


if carry-out 
= 1 


Instr 
Oprd 
Oprd 
Oprd Sel 
Dest 
Destination Selects 


Code 
Addr 
Addr 
EB1- 
Addr 
WE3- 
SELRF1- 
0EY3- 
CF2- 


17-10 
AS-AD 
BS-BO 
EA EBO 
CS-CO 
SELMQ 
WED 
SELRFO OEA 
OEB 
OEYO 
OES 
Cn 
CFO 


00100000 
XX 
XXXX 
000011 
X 
00 
000011 
0 
0000 
10 
X 
X 
XXXX 
0 
1 
110 


Assume register file 3 holds 00000003 
(Hex) and MQ register holds 3699D84E (Hex). 


Operand 


Source 
0011 01101001 
1001 1101 10000100 
1110 I 
MQ shifter +- MQ register 


Destination 
01101101 
0011 0011 1011 0000 
1001 1100 I 
MQ register +- MQ shifter 


Source 
00000000000000000000000000000011 
IS+- 
RF(3) 


Destination 
00000000000000000000000000000100 
I 
RF(3) +- S + Cn 


Arithmetic Right Single Precision Shift 


Performs arithmetic right shift on result of ALU operation specified in lower nibble of 
instruction field. 


The result of the ALU operation specified in instruction bits 13-10is shifted one bit 
to the right. The sign bit of the most significant byte is retained unless it is inverted 
as a result of overflow. Bit 0 of the least significant byte is dropped. 


The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to the Y MUX. If SSF is low, the ALU result will be passed unshifted to 
the Y MUX. 


ALU Shifter 


Arithmetic 
Right 


MQ Shifter 


None 


RF 
RF 
Y-Port 
(C5-CO) 
(B5-BO) 


Yes 
No 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
Yes 
Passes shifted output if high; passes ALU result 


if low. 
SiCo 
No 
LSB is shifted out from each word, which may be 


SI01 
No 
1, 2, or 4 bytes long depending on selected 


SI02 
No 
configuration 
SI03 
No 


Cn 
No 
Affects arithmetic operation specified in bits 13-10of 


instruction field. 


ZERO 


N 


1 if result = 0 


1 if MSB of result 
o if MSB of result 


OVR 
0 
C 
1 if carry-out condition 


t C is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 


Perform the computation A = (A + 8)/2, where A and 8 are single-precision numbers. 
Let A reside in register 1 and 8 be input via the D8 bus. 


Instr 
Oprd 
Op,d 
Op,d Sel 
Dest 
Destination 
Selects 


Code 
Add, 
Add, 
EB1- 
Addr 
WE3- 
SELRF1- 
OEY3- 
CF2- 


17-10 
A5-AO 
B5-BO 
EA EBO 
C5-CO 
SELMQ 
WEO 
SELRFO OEA 
OEB 
OEYO OES 
Cn 
CFO 
SSF 


00000001 
000001 
XX 
XXXX 
0 
10 
000001 
0 
0000 
10 
X 
X 
XXXX 
0 
0 
110 
1 


Intermediate ~ 


Result 


tAfter the intermediate operation (ADD), overflow has occurred and OVR status signal is set high. When the 
arithmetic right shift is executed. the sign bit is corrected (see Table 16 for shift definition notes). 


Performs arithmetic right shift on MQ register (LSH) and result of ALU operation (MSH) 
specified in lower nibble of instruction field. 


The result of the ALU operation specified in instruction bits 13-10is used as the upper 
half of a double precision word, the contents of the MQregister 
as the lower half. 


The contents of the ALU are shifted one bit to the right. The sign bit of the most 
significant byte is retained unless the sign bit is inverted as a result of overflow. Bit 0 
of the least significant byte in the ALU shifter is passed to bit 7 of the most significant 
byte of the MQ register. Bit 0 of the MQ register's least significant byte is dropped. 


The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to the Y MUX. If SSF is low, the ALU result will be passed unshifted to 
the Y MUX. 


ALU Shifter 
MQ Shifter 


Arithmetic 
Right Arithmetic 
Right 


RF 
RF 


(C5-COI 
Y-Port 
(B5-BOl 


Yes 
No 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
Yes 
Passes shifted output if high; passes ALU result 


if low. 


SIOO 
No 
LSB of ALU shifter is passed to MSB of MQ shifter, 


SI01 
No 
and LSB of MQ shifter is dropped. 


SI02 
No 


SI03 
No 


Cn 
No 
Affects 
arithmetic operation specified in bits 13-10of 


instruction field. 


ZERO 


N 


1 if result 
= 0 


1 if MSB of result 
o if MSB of result 


OVR 
0 


C 
1 if carry-out condition 


tc is ALU carry-out and is evaluated before shift operation. ZEROand N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 


Performthe computation A = (A+ B)/2, where A and Baretwo's complement numbers. 
Let A be a double precision number residing in register 1 (MSH) and MQ (LSH). Let 
B be a single precision number which is input through the DB bus. 


Instr 
Oprd 
Oprd 
Oprd Sel 
Dest 
Destination Selects 


Code 
Addr 
Addr 
EB1- 
Addr 
WE3- SELRF1- 
0EY3- 
CF2- 
17-10 
A5-AO 
B5-BO 
EA EBO 
C5-CO 
SELMQ WEO SELRFO OEA OEB OEYO OES 
Cn CFO 
SSF 
00010001 
000001 
XX 
XXXX 
0 
10 
000001 
0 
0000 
10 
X 
X 
XXXX 
0 
0 
110 
1 


Assume register file 1 holds 4A08C618 
(Hex). and DB bus holds 51007530 
(Hex). 


and MQ register holds 17299AOF (Hex). 


Source 
01001010 
00001000 
110001100001 
1000 
R +- RF(1) 


Source 
0101 0001 000000000111 
0101 0011 0000 
S +- DB bus 


Intermediate :I: 


Result 
1001 1011 00001001 
0011 1011 01001000 
ALU Shifter 
+- R + S + Cn 


Destination 
01001101 
100001001001 
1101 10100100 
RF(1) +- ALU shift result 


LSH 


Source 
0001 0111 00101001 
1001 101000001111 
MQ shifter 
+- MQ register 


Destination 
00001011 
1001 0100 1100 1101 00000111 
MQ register 
+- MQ shift result 


tAfter the intermediate operation (ADD). overflow has occurred and OVR status signal is set high. When the 
arithmetic right shift is executed, the sign bit is corrected (see Table 16 for shift definition notes). 


Performs circular right shift on result of ALU operation specified in lower nibble of 
instruction field. 


The result of the ALU operation specified in instruction bits 13-10is shifted one bit 
to the right. Bit 0 of the least significant byte is passedto bit 7 of the most significant 
byte in the same word, which may be 1, 2, or 4 bytes long depending on the selected 
configuration. 


The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to the Y MUX. If SSF is low, the ALU result will be passed unshifted to 
the Y MUX. 


ALU Shifter 


Circular Right 


MQ Shifter 


None 


RF 
RF 
Y-Port 
(C5-CO) 
(B5-BO) 


Yes 
No 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
Yes 
Passes shift result if high; passes ALU result 


if low. 


SIOO 
No 
Rotates LSB to MSB of the same word, which may 


SI01 
No 
be 1, 2, or 4 bytes long depending on configuration 


SI02 
No 


SI03 
No 


Cn 
No 
Affects 
arithmetic operation specified in bits 13-10of 


instruction 
field. 


Circular Right Single Precision Shift 


ZERO 


N 


1 if result = 0 


1 if MSB of result 
o if MSB of result 


OVR 
if signed arithmetic overflow 


C 
if carry-out condition 


t C is ALU carry-out 
and is evaluated 
before 
shift 
operation. 
ZERO and N (negative) 
are evaluated 
after 
shift 
operation. 
OVR (overflow) 
is evaluated 
after 
ALU operation 
and after 
shift 
operation. 


Instr 
Op,d 
Op,d 
Op,d Sel 
Oes! 
Destination Selects 


Code 
Add, 
Add, 
EB1- 
Add, 
WE3- 
SELRF1· 
OEY3- 
CF2- 


17-10 
A5-AO 
B5-BO 
EA EBO 
C5-CO 
SELMQ 
WEO 
SELRFO OEA 
OEB 
OEYO OES 
Cn 
CFO 
SSF 


10000110 
000110 
XX 
XXXX 
0 
XX 
000001 
0 
0000 
10 
X 
X 
XXXX 
0 
0 
110 
1 


Intermediate 
Result 


Performs circular right shift on MQ register (LSH) and result of ALU operation (MSH) 
specified in lower nibble of instruction field. 


The result of the ALU operation specified in instruction bits 13-10is used as the upper 
half of a double precision word, the contents of the MQ register as the lower half. 


The contents of the ALU and MQ shifters are rotated one bit to the right. Bit 0 of the 
least significant byte in the ALU shifter is passed to bit 7 of the most significant byte 
of the MQ shifter. Bit 0 of the least significant byte is passed to bit 7 of the most 
significant byte of the ALU shifter. 


The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to the Y MXU and MQ register. If SSFis low, the Y MUX and MQ register 
will not be altered. 


ALU Shifter 


Circular Right 


MQ Shifter 


Circular Right 


RF 
RF 
Y-Port 
(C5-COI 
(B5-BO) 


Yes 
No 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
Yes 
Passes shift result if high; passes ALU result and 


retains MQ register if low. 


SIOO 
No 
Rotates LSB of ALU shifter to MSB of MQ shifter, 


SIOl 
No 
and LSB of MQ shifter to MSB of ALU shifter 


SI02 
No 


SI03 
No 


Cn 
No 
Affects 
arithmetic operation specified in bits 13-10of 


instruction field. 


Circular Right Double Precision Shift 


ZERO 
1 if result = 0 


N 
1 if MSB of result = 1 
o if MSB of result = 0 


OVR 
1 if signed arithmetic overflow 


C 
1 if carry-out condition 


t C is ALU carry-out and is evaluated before shift operation. ZEROand N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 


Perform a circular right double precision shift of the data in register 6 (MSH) and MQ 
(LSHI. and store the result back in register 6 and the MQ register. 


Instr 
Oprd 
Oprd 
Oprd Sel 
Dest 
Destination 
Selects 


Code 
Addr 
Addr 
EB1- 
Addr 
WE3- 
SELRF1- 
0EY3- 
CF2- 


17-10 
A5-AO 
B5·BO 
EAEBO 
C5-CO 
SELMO 
WEO 
SELRFO 
OEA 
OEB 
OEYO 
OES 
Cn 
CFO 


10010110 
00 0110 
XX 
XXXX 
0 
XX 
00 0110 
0 
0000 
10 
X 
X 
XXX X 
0 
0 
110 


Intermediate 
Result 


Logical Right Single Precision Shift 


Performs logical right shift on result of ALU operation specified in lower nibble of 
instruction field. 


The result of the ALU operation specified in instruction bits 13-10is shifted one bit 
to the right. A zero is placed in the bit 7 of the most significant byte of each word 
unless the SIO input for the word is programmed low; this will force the sign bit to 
one. The LSBis dropped from the word, which may be 1, 2, or 4 bytes long depending 
on selected configuration. 


The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
'will be sent to the Y MUX. If SSF is low, the ALU result will be passed unshifted to 
the Y MUX. 


ALU Shifter 


Logical Right 


MQ Shifter 


None 


RF 
RF 
Y-Port 
(C5-CO) 
(B5-BO) 


Yes 
No 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
Passes shift result if high or floating; 
passes ALU 


result if low. 


SIOO 
Yes 
Fills a zero in MSB of the word if high or floating; 


SI01 
Yes 
fills a one in MSB if low. 


SI02 
Yes 


SI03 
Yes 


Cn 
Inactive 


ten 
is ALU carry-out 
and is evaluated 
before 
shift 
operation. 
ZERO and N (negative) 
are evaluated 
after 
shift 
operation. 
OVR (overflow) 
is evaluated 
after 
ALU operation 
and after 
shift 
operation. 


Perform 
a logical right single precision 
shift on data on the DA bus, and store the result 


in register 
1. 


Instr 
Oprd 
Oprd 
Oprd 5el 
Dest 
DestInation Selects 
- 
- 
- - 
Code 
Addr 
Add' 
EBI- 
Add' 
WE3- 
SELAF 1- 
OEY3- 
CF2- 
S103- 
IESI03- 
- 
- 
- - - - 
- 
17-10 
A5-AO 
B5-BO 
EA EBO 
C5-CO 
SELMa 
WEO 
SELAFO 
OEA 
OEB 
OEYO 
OES 
Cn 
CFO 
SIOO 
IESIOO 
SSF 


00100110 
XX xxxx 
xx 
XXXX 
1 
xx 
00 0001 
0 
0000 
10 
X 
X 
XXXX 
0 
0 
110 
XXXI 
0000 
1 


Intermediate 
Result 


Destination 
I 0001 01101101 010001100011 
00001010 I 
RF(1) 
+-- ALU shift result 


Logical Right Double Precision Shift 


Performs logical right shift on MQ register (LSH) and result of ALU operation (MSH) 
specified in lower nibble of instruction field. 


The result of the ALU operation specified in instruction bits 13-10is used as the upper 
half of a double precision word, the contents of the MQ register as the lower half. 


The ALU result is shifted one bit to the right. A zero is placed in the sign bit of the 
most significant byte unless the SIO input for that word is programmed low; this will 
force the sign bit to one. Bit 0 of the least significant byte is passed to bit 7 of the 
most significant byte of the MQ shifter. Bit 0 of the least significant byte of the MQ 
shifter is dropped. 


The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to the Y MUX and MQ register. If SSF is low, the ALU result and MQ 
register will not be altered. 


ALU Shifter 


Logical Right 


MQ Shifter 


Logical Right 


RF 
RF 
Y-Port 
(C5-CO) 
(B5-BO) 


Yes 
No 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
Yes 
Passes shift result if high; passes ALU result and 


retains MQ 


SIOO 
Yes 
Fills a zero in MSB if high or floating; 


SI01 
Yes 
fills a one MSB if low. 


SI02 
Yes 
Si03 
Yes 


Cn 
No 
Affects arithmetic operation specified in bits 13-10of 


instruction field. 


OVR 


C 


1 if result = 0 


1 if MSB of result = 1 
o if MSB of result = 0 


if signed arithmetic overflow 


if carry-out condition 


ZERO 


N 


t C is ALU carry-out and is evaluated before shift operation. ZEROand N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 


Perform a logical right double precision shift of the data in register 1 (MSH) and MQ 
(LSH). filling a one into the most significant bit, and store the result back in register 1 
and the MQ register. 


Instr 
Oprd 
Op'd 
Op,d 
Sel 
Dest 
Destination 
Selects 


Code 
Add, 
Add, 
EB1· 
Add' 
WE3- 
SELRF 1- 
OEY3· 
CF2- 
S103· 
IESI03- 
- 
- - 
17·10 
A5·AO 
B5-BO 
EA EBO 
C5-CO 
SELMQ 
WEO 
SELRFO 
OEA 
OEB 
OEYO 
OES 
Cn 
CFO 
SIOO 
IESIOO 


00110110 
XX XXXX 
00 
0001 
X 
00 
00 
0001 
0 
0000 
10 
X 
X 
XXXX 
0 
0 
110 
1110ססoo 


Intermediate 
Result 


Immediate data in the range 0 to 15, supplied by the user at A3-AO, is inverted and 
added with carry to S. 


C3-CO 


RF 
A3-AO 
.. 


DA-Port 
.. 


(A5-AO) 
Immed 
A3-AO 


Mask 


No 
Yes 
No 
No 


RF 
MQ 


DB-Port 
(B5-BO) 
Register 


Yes 
Yes 
Yes 


RF 
RF 
Y-Port 
(C5-CO) 
(B5-BO) 


Yes 
No 
Yes 


User 
Signal 
Use 


Programmable 


SSF 
No 
Inactive 


SIOO 
No 
Inactive 


SI01 
No 
Inactive 


SI02 
No 
Inactive 


SI03 
No 
Inactive 


Cn 
Yes 
Two's 
complement 
subtraction 
if programmed 
high. 


ZERO 


N 


OVR 


C 


1 if result = 0 


1 if MSB = 1 


1 if arithmetic signed overflow 


1 if carry-out 


Insl, 
Dp,d 
Dp,d 
Dp,d Sel 
Desl 
Destination Selects 


Code 
Add, 
Add, 
EB1- 
Add, 
WE3- 
SELRF1- 
DEY3- 
CF2- 


17-10 
A5-AO 
B5-BO 
EA EBO 
C5-CO 
SELMQ 
WEO 
SELRFO 
DEA 
DEB 
DEYO 
DES 
Cn 
CFO 


0111 
1000 
001100 
XX 
XXXX 
X 
10 
000001 
0 
0000 
10 
X 
X 
XXXX 
0 
1 
110 


Data on the R bus is subtracted with carry from data on the S bus. The result appears 
at the ALU and MQ shifters . 


• The result of this instruction can be shifted in the same microcycle by specifying a shift instruction in the 
upper nibble 07-14)of the instruction field. The result may also be passed without shift. Possible instructions 
are listed in Table 15. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
(A5-AOI 
Immed 
A3-AO 


Mask 


Yes 
No 
Yes 
No 


RF 
MQ 


DB-Port 
(B5-BO) 
Register 


Yes 
Yes 
Yes 


RF 
RF 
ALU 
MQ 
Y-Port 
(C5-CO) (B5-BO) 
Shifter 
Shifter 


Yes 
No 
Yes 
Yes 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
No 
Affect 
shift instructions 
programmed in bits 17-14of 


SIOO 
No 
instruction 
field. 


SIOl 
No 


SI02 
No 


SI03 
No 


Cn 
Yes 
Two's 
complement 
subtraction 
if programmed high. 


ZERO 


N 


OVR 


C 


1 if result = 0 


1 if MSB 
= 1 


1 if signed arithmetic overflow 


1 if carry-out 


t C is ALU carry-out and is evaluated before shift operation. ZEROand N (negative) are evaluated after shift 
operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 


Subtract data in register 1 from data on the DB bus, and store the result in the MQ 
register. 


Instr 
Oprd 
Op,d 
Op,d Sel 
Dest 
Destination 
Selects 


Code 
Add, 
Add, 
EB1- 
Add, 
WE3- 
SELRF1- 
OEY3- 
CF2- 


17-10 
A5-AO 
B5-BO 
EA EBO 
C5-CO 
SELMQ 
WEO 
SELRFO 
OEA 
OEB 
OEYO 
OES 
Cn 
CFO 


11100010 
000001 
XX XXXX 
0 
10 
XX XXXX 
1 
XXXX 
XX 
X 
X 
XXXX 
0 
1 
110 


Subtract S with Carry (R + S' + Cn) 


Data on the S bus is subtracted with carry from data on the R bus. The result appears 
at the ALU and MQ shifters . 


•The result of this instruction can be shifted in the same microcycie by specifying a shift instruction in the 
upper nibble (17-14)of the instruction field. The result may also be passed without shift. Possible instructions 
are listed in Table 15. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
(A5-AO) 
Immed 
A3-AO 


Mask 


Yes 
No 
Yes 
No 


RF 
MQ 
D6-Port 
(65-60) 
Register 


Yes 
Yes 
Yes 


RF 
RF 
ALU 
MQ 
Y-Port 
(C5-CO) (65-60) 
Shifter 
Shifter 


Yes 
No 
Yes 
Yes 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
No 
Affect 
shift instructions 
programmed in bits 17-14of 


SIOO 
No 
instruction 
field. 


SIOl 
No 


SI02 
No 


Si03 
No 


Cn 
Yes 
Two's 
complement subtraction 
if programmed high. 


Subtract S with Carry (R + S' + Cn) 


ZERO 


N 


OVR 


C 


1 if result = 0 


1 if MSB 
= 1 


1 if signed arithmetic overflow 


1 if carry-out 


tc is ALU carry-out and is evaluated before shift operation. ZEROand N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 


Subtract data on the DB bus from data in register 1, and store the result in the MQ 
register. 


Inst, 
Op,d 
Op,d 
Op,d Sel 
Dest 
Destination Selects 


Code 
Add, 
Add, 
EB1- 
Add, 
WE3- 
SELRFl - 
OEY3· 
CF2- 


17-10 
A5-AO 
B5-BO 
EA EBO 
C5-CO 
SELMQ 
WEO 
SELRFO 
OEA 
OEB 
OEYO 
OES 
Cn 
CFO 


11100011 
000001 
XX 
XXXX 
0 
10 
XX 
XXX 
X 
1 
XXXX 
XX 
X 
X 
XXXX 
0 
1 
110 


The 5 bus is the source word for this instruction. The source word is passed to the 
ALU, where it is compared to an 8-bit mask, consisting of a concatenation of the C3-CO 
and A3-AO address ports (C3-~O::A3-AO). 
The mask is input via the R bus. The test 
will pass if the selected byte has zeros at all bit locations specified by the ones of 
the mask. Bytes are selected by programming the 510 inputs low. Test results are 
indicated on the ZERO output, which goes to one if the test passes. Register write 
is internally disabled during this instruction. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
(A5-AOI 
Immed 
A3-AO 


Mask 


No 
No 
No 
Yes 


RF 
MQ 
DB-Port 
(B5-BOl 
Register 


Yes 
Yes 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
No 
Inactive 


SIOO 
Yes 
Byte Select 


SiD 1 
Yes 
Byte Select 


SiD2 
Yes 
Byte Select 


SiD3 
Yes 
Byte Select 


Cn 
No 
Inactive 


ZERO 


N 


OVR 


C 


1 if result 
(selected 
bytes) 
o 
o 
o 


Instr 
Mask 
Oprd 
Oprd 
Sel 
Mask 
Destination 
Selects 


Code 
ILSH) 
Addr 
EB1- 
(MSH) 
WE3- 
SELRFl 
- 
OEY3- 
CF2- 
S103- 
IESI03- 


EA EBO 
C3-CO 
SELMQ 
- 
OEB 
OEYO 
OES 
iESiOo 
17-10 
A3-AO 
B5-BO 
WEO 
SELRFO 
OEA 
Cn 
CFO 
SIOO 


0011 
1000 
0000 
000011 
X 
00 
1110 
X 
XXXX 
XX 
X 
X 
XXXX 
0 
X 
110 
1010 
0000 


Output Q 
ZERO 
+- 1 


The 5 bus is the source word for this instruction. The source word is passed to the 
ALU, where it is compared to an 8-bit mask, consisting of a concatenation of the C3-CO 
and A3-AO address ports (C3-CO::A3-AO). The mask is input via the R bus. The test 
will pass if the selected byte has ones at all bit locations specified by the ones of the 
mask. Bytes are selected by programming the 510 inputs low. Test results are indicated 
on the ZEROoutput, which goes to one if the test passes. Register write is internally 
disabled for this instruction. 


C3-CO 


RF 
A3-AO 
.. 


DA-Port 
.. 


(A5-AO) 
Immed 
A3-AO 


Mask 


No 
No 
No 
Yes 


RF 
MQ 


DB-Port 
(B5-BO) 
Register 


Yes 
Yes 
Yes 


User 


Signal 
Use 


Programmable 


SSF 
No 
Inactive 


SIOO 
Yes 
Byte Select 


SIOl 
Yes 
Byte Select 


SI02 
Yes 
Byte Select 


5103 
Yes 
Byte Select 


Cn 
No 
Inactive 


ZERO 


N 


OVR 


C 


1 if result 
(selected 
bytes) 


o 
o 
o 


Instr 
Mask 
Oprd 
Oprd 
Sel 
Mask 
Destmation 
Selects 


IMSHI 
WE3- 
OEY3- 
CF2- 
S103- 
-- 
Code 
ILSHI 
Addr 
EB1- 
SELRF1- 
IESI03- 


17-10 
A3-AO 
B5-BO 
EA EBO 
C3-CO 
SELMQ 
WEO 
SELRFO 
OEA 
OEB 
OEYO 
OES 
Cn 
CFO 
SIOO 
iESiOo 


0010 
1000 
0000 
00 
001 1 
X 
00 
1110 
X 
XXX X 
XX 
X 
X 
XXX X 
a 
X 
110 
1001 
0000 


Source 
I 
1000 1000 0001 1100 1101 0000 0000 0011 I 
Sn 
+-- RF(3)n t 


Output G 
ZERO 
+-- 0 


Unsigned Divide Iterate 


Performs one of N-2 iterations of nonrestoring unsigned division by a test subtraction 
of the N-bit divisor from the 2N-bit dividend. An algorithm using this instruction can 
be found in the "Other Arithmetic Instructions" 
section. 


UDIVIperforms a test subtraction of the divisor from the dividendto generatea quotient 
bit. The test subtraction may passor fail and is corrected in the subsequent instruction 
if it fails. Similarly a failed test from the previous instruction is corrected during 
evaluation 
of 
the 
current 
UDIVI 
instruction 
(see the 
"Other 
Arithmetic 
Instructions" section for more details). 


The R bus must be loaded with the divisor, the S bus with the most significant half 
of the result of the previous instruction (UDIVI during iteration or UDIVIS at the 
beginning of iteration). The least significant half of the previous result is in the MQ 
register. 


UDIVI checks the result of the previous pass/fail test and then evaluates: 


F+-R+S 
F +- R' + S + en 
if the test is failed 
if the test is passed 


A double precision left shift is performed; bit 7 of the most significant byte of the 
MQ shifter is transferred to bit 0 of the least significant byte of the ALU shifter. Bit 7 
of the most significant byte of the ALU shifter is lost. The unfixed quotient bit is 
circulated into the least significant bit of the MQ shifter. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
.. 


(A5-AOI 
Immed 
A3-AO 


Mask 


Yes 
No 
Yes 
No 


RF 
MQ 
D8-Port 
(85-80l 
Register 


Yes 
Yes 
No 


Unsigned Divide Iterate 


RF 
RF 
Y-Port 
(C5-COI (65-60) 


Yes 
No 
Yes 


User 


Signal 
Programmable 
Use 


SSF 
No 
Inactive 


5100 
No 
Passes internally generated end-fill bit. 
5101 
No 
Si02 
No 
5103 
No 


Cn 
Yes 
Should be programmed high. 


ZERO 


N 


OVR 


C 


1 if result = 0 
o 
o 


1 if carry-out 


Unsigned Divide Start 


Computes 
the 
first 
quotient 
bit 
of 
nonrestoring 
unsigned 
division. 
An 
algorithm using this instruction is given in the "Other Arithmetic Instructjions" section. 


UDIVIS computes the first quotient bit during nonrestoring unsigned division by 
subtracting the divisor from the dividend. The resulting remainder due to subtraction 
may be negative; the subsequent UDIVI instruction may have to restore the remainder 
during the next operation. 


The R bus must be loaded with the divisor and the S bus with the most significant 
half of the remainder. The result on the Y bus should be loaded back into the register 
file for use in the next instruction. The least significant half of the remainder is in the 
MQ register. 


UDIVIS computes: 


F +- R' + S + Cn 


A double precision left shift is performed; bit 7 of the most significant byte of the 
MQ shifter is transferred to bit 0 of the least significant byte of the ALU shifter. Bit 7 
of the most significant byte of the ALU shifter is lost. The unfixed quotient bit is 
circulated into the least significant bit of the MQ shifter. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
(A5-AOl 
Immed 
A3-AO 


Mask 


Yes 
No 
Yes 
No 


RF 
MQ 


DB-Port 
(B5-BO) 
Register 


Yes 
Yes 
No 


RF 
RF 


(C5-COl 
Y-Port 
(B5-BO) 


Yes 
No 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
No 
Inactive 
5100 
No 
Passes internally generated end-fill bit. 
Si01 
No 
5102 
No 
5103 
No 


Cn 
Yes 
Should be programmed high. 


ZERO 


N 


OVR 


C 


1 if intermediate result 
= 0 
o 


1 if divide overflow 


1 if carry-out 


Unsigned Divide Terminate 


Solves the final quotient bit during nonrestoring unsigned division. An algorithm using 
this instruction is given in the "Other Arithmetic Instructions" 
section. 


UDIVIT performs the final subtraction of the divisor from the remainder during 
nonrestoring signed division. UDIVIT is preceded by N-1 iterations of UDIVI, where 
N is the number of bits in the dividend. 


The R bus must be loaded with the divisor, the S bus must be loaded with the most 
significant half of the result of the last UDIVI instruction. The least significant half 
lies in the MQ register. The Y bus result must be loaded back into the register file for 
use in the subsequent DIVRF instruction. 


UDIVIT checks the results of the previous pass/fail test and evaluates: 


Y-R+S 


Y - 
R' + S + en 
if the test is failed 
if the test is passed 


The contents of the MQ register are shifted one bit to the left; the unfixed quotient 
bit is circulated into the least significant bit. 


C3-CO 


RF 
A3-Ao 
.. 


DA-Port 
.. 


(A5-Ao) 
Immed 
A3-AO 


Mask 


Yes 
No 
Yes 
No 


RF 
MQ 
DB-Port 
(B5-Bo) 
Register 


Yes 
Yes 
No 


RF 
RF 


(C5-COI 
Y-Port 
(B5-BOl 


Yes 
No 
Yes 


Unsigned Divide Terminate 


User 
Signal 
Use 
Programmable 


SSF 
No 
Inactive 


SIOO 
No 
Passes internally generated end-fill bit. 


SI01 
No 


SI02 
No 


SI03 
No 


Cn 
Yes 
Should be programmed high. 


ZERO 


N 


OVR 


C 


1 if intermediate 
result 
0 


o 
o 


1 if carry-out 


Unsigned Multiply 
Iterate 


Performs one of N unsigned multiplication iterations for computing an N-bit by N-bit 
product. An algorithm for unsigned multiplication using this instruction is given in the 
"Other Arithmetic Instructions" 
section. 


UMULI checks to determine whether the multiplicand should be added with the present 
partial product. The instruction evaluates: 


F +- R + S + en 
F+-S 


if the addition is required 
if no addition is required 


A double precision right shift is performed. Bit 0 of the least significant byte of the 
ALU shifter is passed to bit 7 of the most significant byte of the MQ shifter; carry-out 
is passed to the most significant bit of the ALU shifter. 


The S bus should be loaded with the contents of an accumulator and the R bus with 
the multiplicand. The Y bus result should be written back to the accumulator after 
each iteration of UMULI. The accumulator should be clearedand the MQ register loaded 
with the multiplier before the first iteration. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
.. 
(A5-AOl 
Immed 
A3-AO 
Mask 
Yes 
No 
Yes 
No 


RF 
DB-Port 
MQ 


(B5-BO) 
Register 
Yes 
Yes 
No 


RF 
RF 
Y-Port 
(C5-CO) 
(85-BO) 


Yes 
No 
Yes 


Unsigned Multiply 
Iterate 


User 
Signal 
Use 
Programmable 


SSF 
No 
Holds LSB of MO. 


SIOO 
No 
Passes internal input (shifted bit). 


SI01 
No 


SI02 
No 


SI03 
No 


Cn 
Yes 
Should be programmed low. 


ZERO 


N 


OVR 


C 


1 if result = 0 


1 if MSB = 1 
o 


1 if carry-out 


Data on the R bus is exclusive ORed with data on the S bus. The result appears at 
the ALU and MQ shifters. 


"The result of this instruction can be shifted in the same microcycle by specifying a shift instruction in the 
upper nibble (17-14)of the instruction field. The result may also be passed without shift. Possible instructions 
are listed in Table 15. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
(A5-AO) 
Immed 
A3-AO 


Mask 


Yes 
No 
Yes 
No 


RF 
MQ 
DB-Port 
(B5-BO) 
Register 


Yes 
Yes 
Yes 


RF 
RF 
ALU 
MQ 
Y-Port 
(C5-CO) (B5-BO) 
Shifter 
Shifter 


Yes 
No 
Yes 
Yes 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
No 
Affect 
shift instructions 
programmed 
in bits 17-14 of 


SIOO 
No 
instruction 
field. 


SI01 
No 


SI02 
No 


SI03 
No 


Cn 
No 
Inactive 


ZERO 


N 


OVR 


C 


1 if result 
= 0 


1 if MSB 
= 1 
o 
o 


t C is ALU carry-out and is evaluated before shift operation. ZEROand N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 


Instr 
Oprd 
Oprd 
Oprd Sel 
Dest 
Destination 
Selects 
- 
<5EYJ 
Code 
Addr 
Addr 
EB1- 
Addr 
WEJ- 
SELRF1- 
CF2- 


17-10 
A5-AO 
B5-BO 
EA EBO 
C5-CO 
SELMQ 
WEO 
SELRFO 
OEA 
OEB 
OEYO 
OES 
Cn 
CFO 


1111 
1001 
000011 
000101 
0 
00 
000101 
0 
0000 
10 
X 
X 
XXXX 
0 
X 
110 


ZERO 


N 


OVR 


C 


1 if result 
(selected 
bytes) 
= a 


a 


1 if signed 
arithmetic 
overflow 
(selected 
bytes) 


1 if carry-out 
(most 
significant 
selected 
byte) 
= 1 


Add bytes 1 and 2 of register 3 with carry to the contents of register 1 and store the 
result in register 11. 


Instr 
Op,d 
Op,d 
Op'd 
Sel 
Dest 
Destination 
Selects 


Code 
- 
- 
- 
Add, 
Add, 
EB1· 
Addr 
WE3· 
SELRF 1· 
OEY3· 
CF2· 
5103· 
IESI03· 


C5·CO 
WEO 
- - 
17·10 
A5·AO 
B5·BO 
EA EBO 
SELMQ 
SELRFO 
OEA 
OEB 
OEYO 
OES 
Cn 
CFO 
5100 
IESIOO 


0100 
1000 
00 
0011 
00 
0001 
a 
00 
00 
1011 
a 
0000 
10 
X 
X 
xxxx 
a 
1 
110 
1001 
0000 


Destination 
I 0111 10101001 0001 01001111 0011 1110 I 
RF( 11)n 
•... Fn or Sn t 


t F = ALU 
result 
n = nth 
byte 
Register 
file 
11 gets 
F if byte 
selected, 
S if byte 
not selected. 


Byte AND RAND 
5 (Byte Logical AND RAND 
51 


Bytes with their corresponding SIO signals programmed low compute RAND S. Bytes 
with SIO signals programmed high, pass S unaltered. Multiple bytes can be selected 
only if they are adjacent to one another. At least one byte must be nonselected. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 


(A5-AO) 
Immed 
A3-AO 


Mask 


Yes 
No 
Yes 
No 


RF 
MQ 


DB-Port 
(B5-BO) 
Register 


Yes 
Yes 
Yes 


RF 
RF 
Y-Port 
(C5-CO) 
(B5-BO) 


Yes 
No 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
No 
Forced low 


SIOO 
Yes 
Byte select 


SI01 
Yes 
Byte select 


SI02 
Yes 
Byte select 


SI03 
Yes 
Byte select 


Cn 
No 
Inactive 


ZERO 


N 


OVR 


C 


1 if result 
(selected 
bytes) 
o 
o 
o 


Logically AND bytes 1 and 2 of register 3 with input on the DB bus; store the result 
in register 3. 


Instr 
Op,d 
Op,d 
Oprd 
Sel 
Dest 
Destination 
Selects 


Code 
- 
- 


S103- 
IESI03- 
Add, 
Add, 
EB1- 
Add, 
WE3- 
SELRF1- 
OEY3- 
CF2- 


17-10 
A5-AO 
B5-BO - 
C5-CO 
SELMQ 
- - 
- 
IESIOO 
EA EBO 
WED 
SELRFO 
OEA 
OEB 
OEYO 
OES 
Cn 
CFO 
SIOO 


11101000 
00 
DOll 
XX XXXX 
0 
10 
00 
0011 
0 
0000 
10 
X 
X 
XXX X 
0 
X 
110 
1001 
0000 


Assume register file 3 holds 398FBEBE (Hex) and input on the DB port is 4290BFBF 
(Hex): 


tF 
= ALU result 
n = nth byte 
Register 
file 3 gets 
F if byte 
selected. 
S if byte 
not selected. 


This instruction allows the user to convert an N-digit BCD number to a 4N-bit binary 
number in 4(N-1) plus 8 clocks. The instruction sums the Rand S buses with carry. 


A one-bit arithmetic left shift is performed on the ALU output. A zero is filled into bit 0 
of the least significant byte unless SIOOis set low, which would force bit 0 to one. 
Bit 7 of the most significant byte is dropped. 


Simultaneously, the contents of the MQ register are rotated one bit to the left. Bit 
7 of the most significant byte is rotated to bit 0 of the least significant byte. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
.. 


(A5-AO) 
Immed 
A3-AO 


Mask 


Yes 
No 
No 
No 


RF 
MQ 
DB-Port 
(B5-BO) 
Register 


Yes 
Yes 
No 


RF 
RF 
Y-Port 
(C5-CO) 
(B5-BO) 


Yes 
No 
No 


User 
Signal 
Use 


Programmable 


SSF 
No 
Inactive 


SIOO 
Yes 
If high or floating, 
fills 
a zero in LSB of ALU shifter; 


if low, 
fills 
a one in LSB of ALU shifter. 


SIOl 
No 
Inactive 
in 32-bit 
configuration. 
Used in other 


SI02 
No 
configurations 
to select 
endfill 
in LSBs. 


SI03 
No 


Cn 
Yes 
Should 
be programmed 
low 
for proper 
conversion. 


ZERO 


N 


OVR 


C 


if result 
= 0 


if MSB = 1 


if signed 
arithmetic 
overflow 


if carry-out 
= 1 


The following 
code converts 
an N-digit BCD number to a 4N-bit binary number in 4(N-l ) 
plus 8 clocks. 
This is one possible 
user generated 
algorithm. 
It employs 
the standard 
conversion 
formula 
for a BCD number 
(shown 
here for 32 bits): 


The conversion 
begins with 
the most significant 
BCD digit. 
Addition 
is performed 
in 
radix 
2. 


PSEUDOCODE 


LOADMO 


SUB 


SLCMO 


SLCMO 


ADDI 


BCD to Binary 


NUM 


ACC, ACC, SLCMO 


Repeat N-1 times: 


(N = number of BCD digits) 


(END REPEAT) 


AND 


ADD 


MO, MSK, R1, 
SLCMO 


MO MSK, R1 


ACC, R1, ACC 


Clear accumulator; 
Circular left shift MO. 


Clear mask register; 
Circular left shift MO. 


Extract one digit; 
Circular left shift MO. 


Add extracted digit to 
accumulator, and store result in 
R1; Circular left shift MO. 


Perform BCDBIN instruction, 
and 
store result in accumulator 
[4 x (ACC + 4 x digit)]; 
Circular left shift MO. 


Perform BCDBIN instruction, 
and 


store result in accumulator 
[10 x (ACC + 10 x digit)]; 
Circular left shift MO. 


Fetch last digit. 


Add in last digit and store result 
in accumulator. 


Byte Increment 
Negative 
S with 
Carry 


Bytes with SIOO programmed low compute S' + en. Bytes with SIOO programmed 
high pass S unaltered. Multiple bytes can be selected only if they are adjacent to one 
another. At least one byte must be nonselected. 


C3-CO 


RF 
A3-AO 
.. 


DA-Port 
.. 


(A5-AOI 
Immed 
A3-AO 


Mask 


No 
No 
No 
No 


RF 
MQ 
DB-Port 
(B5-BO) 
Register 


Yes 
Yes 
Yes 


RF 
RF 
Y-Port 
(C5-COl 
(B5-BO) 


Yes 
No 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
No 
Inactive 


SIOO 
Yes 
Byte select 


SIOl 
Yes 
Byte select 


SI02 
Yes 
Byte select 


SI03 
Yes 
Byte select 


Cn 
Yes 
Propagates 
through 
nonselected 
bytes; 
increments 


selected 
byte(s) 
if programmed 
high. 


Byte Increment 
Negative 
S with 
Carry 


ZERO 


N 


OVR 


C 


1 if result 
(selected 
bytes) 
= 0 
o 


1 if signed 
arithmetic 
overflow 
(selected 
bytes) 


1 if carry-out 
(most 
significant 
selected 
byte) 


Invert 
bytes 0 and 1 of register 
3 and add them to the carry (bytes 
2 and 3 are not 
changed). 
Store the result 
in register 
3. 


lnstr 
Op,d 
Oprd 
Oprd 
Sel 
Dest 
Destination 
Selects 


Code 
WE3- 
SELRF1- 
- 
Add, 
Add, 
EB1- 
Add, 
OEY3- 
CF2- 
S103- 
IESI03- 


OEA - 
- 
iEsiOo 
17-10 
A5-AO 
B5-BO 
EA EBO 
C5-CO 
SELMQ 
WEO 
SELRFO 
OEB 
OEYO 
OES 
Cn 
CFO 
SIOO 


11001000 
XX 
XXXX 
000001 
X 
00 
000011 
0 
0000 
10 
X 
X 
XXX X 
0 
1 
110 
1100 
0000 


t F = ALU result 
n = nth byte 
Register 
file 3 gets 
F if byte 
selected, 
S if byte 
not selected. 


Byte Increment 
S with 
Carry 


Bytes with 510' inputs programmed low compute 5 + en. Bytes with 510 inputs 
programmed high, pass 5 unaltered. Multiple bytes can be selected only if they are 
adjacent to one another. At least one byte must be nonsselected. 


C3-CO 


RF 
A3-AO 
.. 


DA-Port 
.. 


(A5-AOl 
Immed 
A3-AO 


Mask 


No 
No 
No 
No 


RF 
MQ 
DB-Port 
(B5-BO) 
Register 


Yes 
Yes 
Yes 


RF 
RF 
Y-Port 


(C5-COl 
(B5-BO) 


Yes 
No 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
No 
Inactive 


SIOO 
Yes 
Byte 
select 


SI01 
Yes 
Byte 
select 


SI02 
Yes 
Byte 
select 


SI03 
Yes 
Byte 
select 


Cn 
Yes 
Propagates 
through 
nonselected 
bytes; 
increments 


selected 
byte(s) 
if programmed 
high. 


ZERO 


N 


OVR 


C 


1 if result (selected bytes) 
0 


o 


1 if signed arithmetic overflow (selected bytes) 


1 if carry-out (most significant selected byte) 


EXAMPLE (assumes 
a 32-bit 
configuration) 


Add bytes 1 and 2 of register 7 to the carry (bytes 0 and 3 are not changed). Store 
the result in register 2. 


Instr 
Op,d 
Op,d 
Op,d 
Sel 
Dest 
Destination Selects 
- 
- 
- -- 
Code 
Add, 
Add, 
EB1- 
Add, 
WE3- 
SELRF 1- 
OEY3- 
CF2- 
S103- 
IESI03- 
- 
C5-CO 
- - 
17-10 
A5-AO 
B5-BO 
EA EBO 
SELMQ 
WEO 
SELRFO 
OEA 
OEB 
OEYO 
OES 
Cn 
CFO 
SIOO 
IESIOO 


1011 
1000 
XX XXXX 
000111 
X 
00 
000010 
0 
0000 
10 
X 
X 
XXXX 
0 
1 
110 
1100 
0000 


Destination I 
0100 
0000 
1000 
1111 
1011 
1111 
1011 
1110 I 
RF(2)n +-- Fn or Snt 


t F = ALU 
result 
n = nth 
byte 
Register 
file 
11 gets 
F if byte 
selected. 
S if byte 
not 
selected_ 


Binary to Excess-3 


This instruction converts an N-digit binary number to a N/4 digit excess-3 number 
representation in 2N + 3 clocks. The data on the Rand S buses are added to the carry- 
in, which contains the most significant bit of the MQ register. The contents of the 
MQ register are rotated one bit to the left. The most significant bit is shifted out and 
passed to the least significant bit position. Depending on the configuration selected, 
this shift may be within the same byte or from the most significant byte to the least 
significant byte. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
(A5-AOI 
Immed 
A3-AO 


Mask 


Yes 
No 
No 
No 


RF 
MQ 
DB-Port 
(B5-BO) 
Register 


Yes 
Yes 
No 


RF 
RF 
Y-Port 
(C5-COl 
(B5-BO) 


Yes 
No 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
No 
Inactive 


SIOO 
No 
Inactive 


SI01 
No 
Inactive 


SI02 
No 
Inactive 


SI03 
No 
Inactive 


Cn 
No 
Holds MSB of MQ register. 


ZERO 
1 if result = 0 


N 
1 if MSB = 1 


OVR 
1 if signed arithmetic over~low 


C 
1 if carry-out 
= 1 


The following code converts an N-digit binary number to a N/4 digit excess-3 number 
in 2N + 3 clocks. It employs the standard conversion formula for a binary number: 


an 2n + an-1 2n-1 + an-2 2n-2 + ... 
+ ao 


([(2an + an-1) x 2 + an-1J x 2 + ... 
+ ao) x 2 + aO• 


The conversion begins with the most significant 
bit. Addition during the BINEX3 


instruction is performed in radix 10 (excess-3). 


LOADMO NUM 


SUB ACC, ACC, ACC 


SET1 ACC, 33 (Hex) 
Store 33 (Hex) in all bytes of 
accumulator. 


Repeat N times: 


(N = number of bits in binary number) 


Double accumulator and add in most 
significant bit of MO register. Circular left 
shift MO. 


EX3C ACC 


(END REPEAT) 


Byte OR Rand 
S 
(Byte Inclusive OR Rand 
S) 


Bytes with 
510 inputs programmed low evaluate R OR 5. Bytes with 
510 inputs 
programmed high, pass 5 unaltered. Multiple bytes can be selected only if they are 
adjacent to one another. At least one byte must be nonselected. 


C3-CO 


RF 
A3-AO 
.. 


DA-Port 


.. 


(A5-AO) 
Immed 
A3-AO 


Mask 


Yes 
No 
Yes 
No 


RF 
MQ 
DB-Port 
(B5-BO) 
Register 


Yes 
Yes 
Yes 


RF 
RF 
Y-Port 
(C5-CO) 
(B5-BO) 


Yes 
No 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
No 
Inactive 


5100 
Yes 
Byte 
select 


5101 
Yes 
Byte 
select 


5102 
Yes 
Byte 
select 


5103 
Yes 
Byte 
select 


Cn 
No 
Inactive 


Byte OR Rand 
S 
(Byte Inclusive OR Rand 
S) 


ZERO 


N 


OVR 


C 


1 if result (selected bytes) 
o 
o 
o 


Logically ORbytes 1 and 2 of register 12 with bytes 1 and 2 on the DB bus. Concatenate 
with DB bytes 0 and 3, storing the result in register 12. 


Instr 
Oprd 
Oprd 
Oprd 
Sel 
Dest 
Destination 
Selects 
- 
- 
Code 
Addr 
Addr 
EB1- 
Addr 
WE3- 
SELRF 1- 
OEY3- 
CF2- 
5103- 
IESt03- 
- 
C5-CO 
SELMQ 
W~O' 
- - 
- 
CFO 
5100 
tEStOO 
t7-10 
A5-AO 
B5-BO 
EA EBO 
SELRFO 
OEA 
OEB 
OEYO 
OES 
Cn 


1111 
1000 
001100 
XX 
XXX X 
0 
10 
001100 
0 
0000 
10 
X 
X 
XXX X 
0 
X 
110 
1001 
0000 


tF 
= ALU 
result 
n = nth 
package 
Register 
file 
12 gets 
F if byte 
selected. 
S if byte 
not selected. 


Bytes with SIO inputs programmed low compute R' + S + en. Bytes with SIO inputs 
programmed high, pass S unaltered. Multiple bytes can be selected only if they are 
adjacent to one another. At least one byte must be nonselected. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
(A5-AO) 
Immed 
A3-AO 


Mask 


Yes 
No 
Yes 
No 


RF 
MQ 
DB-Port 
(B5-BOl 
Register 


Yes 
Yes 
Yes 


RF 
RF 
Y-Port 
(C5-CO) (B5-BO) 


Yes 
No 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
No 
Inactive 


SIOO 
Yes 
Byte select 


SIOl 
Yes 
Byte select 


SI02 
Yes 
Byte select 


SI03 
Yes 
Byte select 


Cn 
Yes 
Propagates through 
nonselected bytes; should be 


set high for two's 
complement 
subtraction. 


Byte Subtract R from S with Carry 


ZERO 


N 


OVR 


C 


1 if result (selected bytes) = 0 
o 


1 if signed arithmetic overflow (selected bytes) 


1 if carry-out (most significant selected byte) 


Subtract bytes 1 and 2 of register 1 with carry from bytes 1 and 2 of register 3. 
Concatenate with bytes 0 and 3 of register 3, storing the result in register 11. 


Instr 
Op,d 
Op,d 
Op,d 
5el 
Des! 
Destination 
Selects 


Code 
Addr 
Add' 
EB1· 
Add, 
WE3· 
SELRF1· 
OEY3· 
CF2· 
5103· 
IESI03· 


17·10 
A5·AO 
B5·BO 
EA EBO 
C5·CO 
SELMQ 
WEO 
5ELRFO 
OEA 
DEB 
OEYO 
OES 
Cn 
CFO 
SiCo 
iESiOo 


10101000 
DO 0001 
000011 
0 
00 
001011 
0 
0000 
10 
X 
X 
XXXX 
0 
1 
110 
1001 
0000 


t F = AlU 
result 
n = nth package 
Register file 11 gets F if byte selected. S if byte not selected. 


Byte Subtract S from R with Carry 


DESCRIPTION 


Bytes with SIO inputs programmed low compute R + S' + en. Bytes with SIO inputs 
programmed high, pass S unaltered. Multiple bytes can be selected only if they are 
adjacent to one another. At least one byte must be nonselected. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
.. 


(A5-AO) 
Immed 
A3-AO 


Mask 


Yes 
No 
Yes 
No 


RF 
MQ 
DB-Port 
(B5-BO) 
Register 


Yes 
Yes 
Yes 


RF 
RF 
Y-Port 
(C5-CO) (B5-BO) 


Yes 
No 
Yes 


Signal 
User 
Use 
Programmable 


SSF 
No 
Inactive 


SIOO 
Yes 
Byte select 


SIOl 
Yes 
Byte select 


SI02 
Yes 
Byte select 


SI03 
Yes 
Byte select 


Cn 
Yes 
Propagates through nonselected bytes; should be 


set high for two's 
complement 
subtraction. 


Byte Subtract S from R with Carry 


ZERO 


N 


OVR 


C 


1 if result (selected bytes) = 0 
o 


1 if signed arithmetic overflow (selected bytes) 


1 if carry-out (most significant selected byte) 


Subtract bytes 1 and 2 of register 3 with carry from bytes 1 and 2 of register 1. 
Concatenate with bytes 0 and 3 of register 3. storing the result in register 11. 


Instr 
Op,d 
Op'd 
Op,d 
Sel 
Dest 
Destination 
Selects 
- 
OEY3- 
5103- 
IESI03- 
Code 
Add, 
Add' 
EB1- 
Add' 
WE3- 
SELRF 1- 
CF2- 


BS-BO - 
CS-CO 
SELMQ 
- 
OEYO - 
5100 
IESIOO 
17-10 
AS-AO 
EA EBO 
WEO 
SELRFO 
OEA 
OEB 
OES 
Cn 
CFO 


1001 
1000 
000001 
000011 
0 
00 
001011 
0 
0000 
10 
X 
X 
XXXX 
0 
1 
110 
1001 
0000 


tF 
= ALU result 
n = nth byte 
Register 
file 
11 gets 
F if byte 
selected, 
S if byte 
not selected_ 


Byte XOR Rand 
S 
(Byte Exclusive OR Rand 
S) 


Bytes with SIO inputs programmed low evaluate R exclusive OR S. Bytes with SIO 
inputs programmed high. pass S unaltered. Multiple bytes can be selected only if they 
are adjacent to one another. At least one byte must be nonselected. 


C3-CO 


RF 
A3-AO 
.. 


DA-Port 
.. 


(A5-AQ) 
Immed 
A3-AO 


Mask 


Yes 
No 
Yes 
No 


RF 
MQ 
DB-Port 
(B5-BO) 
Register 


Yes 
Yes 
Yes 


RF 
RF 
Y-Port 
(C5-COl 
(B5-BO) 


Yes 
No 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
No 
Inactive 


5100 
Yes 
Byte select 


5101 
Yes 
Byte select 


5102 
Yes 
Byte select 


5103 
Yes 
Byte select 


Cn 
No 
Inactive 


Byte XOR Rand 
S 
(Byte Exclusive OR Rand 
S) 


ZERO 


N 


OVR 


C 


1 if result 
(selected 
bytes) 
o 
o 
o 


Exclusive ORbytes 1 and 2 of register 6 with bytes 1 and 2 on the DB bus; concatenate 
the result with DB bytes 0 and 3, storing the result in register 10. 


Instr 
Op,d 
Op,d 
Op,d 
Sel 
Dest 
Destination 
Selects 


Code 
- 
S103- 
IESI03- 
Add, 
Add, 
EB1- 
Addr 
WE3- 
SELRF'- 
OEY3- 
CF2- 


17-10 
AS-AO 
BS-BO 
EA EBO 
CS-CO 
SELMQ 
WEO 
SELRFO - - 
- 
SIOO 
OEA 
OEB 
OEYO 
OES 
Cn 
CFO 
IESIOO 


1101 
1000 
000110 
XX XXXX 
0 
10 
001010 
a 
0000 
10 
X 
X 
XXXX 
a 
1 
110 
1001 
0000 


tF 
= ALU 
result 
n = nth 
package 
Register 
file 
10 gets 
F if byte 
selected, 
S if byte 
not 
selected. 


ITTIJt 


tThis instruction may also be coded with the following opcodes: 
(2) IF). (3) IF). 141IFI. 16JIF). IB) IF). Ie] IF). IE] IF] 


C3-CO 


RF 
A3-AO 
.. 


DA-Port 
.. 


IA5-AOl 
Immed 
A3-AO 


Mask 


No 
No 
No 
No 


RF 
MQ 


DB-Port 
(B5-BOI 
Register 


No 
No 
No 


RF 
RF 
Y-Port 
(C5-COI 
(B5-BOI 


Yes 
No 
Yes 


I 


ZER~ 
= ~ 


OVR 
= 0 


Cn 
= 
0 


Data on the R bus is exclusive ORed with data on the S bus. If MOO XNORed with 
SO is zero (MOO is the LSB of the MO register and SO is the LSB of S-bus data), the 
result is sent to the ALU shifter. Otherwise, data on the S bus is sent to the ALU shifter. 


A right shift is performed; the MSB is filled with RO(MOO XOR SO), where ROis the 
LSB of R-bus data. A circular right shift is performed on MO data. 


C3-CO 


RF 
A3-AO 
.. 


DA-Port 
.. 


(A5-AOl 
Immed 
A3-AO 


Mask 


Yes 
No 
No 
No 


RF 
MQ 


DB-Port 
(B5-BO) 
Register 


Yes 
Yes 
No 


RF 
RF 
Y-Port 
(C5-CO) 
(B5-BO) 


Yes 
No 
No 


User 
Signal 
Use 


Programmable 


SSF 
No 
Inactive 


5100 
No 
Inactive 


5101 
No 
Inactive 


5102 
No 
Inactive 


5103 
No 
Inactive 


Cn 
No 
Inactive 


Status Signals 


I 


ZERO 
= 
1 if result = 0 


N 
= 0 


OVR 
= 0 


Cn 
= 0 


CYCLIC REDUNDANCY 
CHARACTER CHECK 


DESCRIPTION 


Serialbinary data transmitted over a channel is susceptible to error bursts. These bursts 
may be detected and corrected by standard encoding methods such as cyclic 
redundancy check codes, fire codes, or computer generated codes. These codes all 
divide the message vector by a generator polynomial to produce a remainder that 
contains parity information about the message vector. 


If a message vector of m bits, a(x). is divided by a generator polynomial, g(x). of order 
k-1, a k bit remainder, r(x). is formed. The code vector, c(x). consisting of mIx) and 
r(x) of length n = m + k is transmitted down the channel. The receiver divides the 
received vector by g(x). 


After m divide iterations, r(x) will be regeneratedonly if there is no error in the message 
bits. After k more iterations, the result will be zero if and only if no error has occurred 
in either the message or the remainder. 


An algorithm for a cyclic redundancy character check, using the' ACT8832A as a 
receiver, is given below: 
LOADMQ VEC(X) 
Load MQ with first 32 message bits of 
received vector c'(x). 


LOAD POLY 


CLEAR SUM 


REPEAT(n/32) TIMES: 


SUM = SUM CRC POLY 


Load register with polynomial g(x). 


Clear register acting as accumulator. 


Perform CRC instruction where 


R Bus = POLY 
S Bus = SUM 


Store result in SUM. 


Load MQ with next 32 message bits of 
received vector c' (x). 


SUM now contains the remainder [r'(x)] of c'(x). A syndrome generation routine may 
be called next, if required. 


Note that the most significant bit of 


g(x) = (gk-1)(xk-1) + (9k_2)(xk-2) + . ·(go)(xO) 


is implied and that POLY(O) is set to zero if the length of g(x) requires fewer bits than 
are in the machine word width. 


DIVRF tests the result of the final step in nonrestoring division iteration: SDIVIT (for 
signed division) or UDIVIT (for unsigned division). An error in the remainder results 
when it is nonzero and the signs of the remainder and the dividend are different. 


The R bus must be loaded with the divisor and the S bus with the most significant 
half of the previous result. The least significant half is in the MO register. The Y bus 
result must be stored in the register file for use during the subsequent SDIVOF 
instruction. 


DIVRF tests to determine whether a fix is required and evaluates: 
Y +- S + R' + 1 if a fix is necessary 
Y +- S + R + 0 if a fix is unnecessary 


Overflow is reported to OVR at the end of the division routine (after SDIVOF). 


C3-CO 


RF 
A3-AO 
.. 


DA-Port 
.. 


(A5-AO) 
Immed 
A3-AO 


Mask 


Yes 
No 
No 
No 


RF 
MQ 
DB-Port 
(B5-BOI 
Register 


Yes 
Yes 
No 


RF 
RF 
Y-Port 
(C5-COI 
(B5-BO) 


Yes 
No 
No 


User 
Signal 
Use 
Programmable 


SSF 
No 
Inactive 


SIOO 
No 
Inactive 


SIOl 
No 
Inactive 


SI02 
No 
Inactive 


SI03 
No 
Inactive 


Cn 
Yes 
Should be programmed high 


ZERO 
1 if remainder = 0 


N 
0 


OVR 
0 


Cn 
1 if carry-out 


Double·Length Normalize 


Tests the two most significant bits of a double precision number. If they are the same, 
shifts the number to the left. 


This instruction is used to normalize a two's complement. double precision number 
by shifting the number one bit to the left and filling a zero into the LSB unless 5100 
is low. The 5 bus holds the most significant half; the MQ register holds the least 
significant half. 


Normalization is complete when overflow occurs. The shift is inhibited whenever 
normalization is attempted on a number already normalized. 


C3-CO 


RF 
A3-AO 
.. 


DA-Port 
.. 


(A5-AOI 
Immed 
A3-AO 


Mask 


No 
No 
No 
No 


Recommended S Bus Source 
Operands (MSH) 


RF 
MQ 
DB-Port 
(85-BO) 
Register 


Yes 
No 
No 


Recommended Destination 
Operands 
Shift Operations 
(conditional) 


RF 
RF 
Y-Port 
(C5-COI 
(85-BO) 


Yes 
No 
No 


Double-Length Normalize 


User 
Signal 
Use 
Programmable 


SSF 
No 
Inactive 


SIOO 
Yes 
When low, selects a one end-fill bit in LSB 


SIOl 
No 
Passes internally generated end-fill bits 


SI02 
No 


SI03 
No 


Cn 
No 


ZERO 


N 


OVR 


Cn 


if result 
= 0 


if MSB = 1 


1 if MSB XOR 2nd MSB 
o 


Normalize 
a double-precision 
number. 


(This example 
assumes that the MSH of the number to be normalized 
is in register 
3 


and the LSH is in the MQ register. The zero on the OVR pin at the end of the instruction 
cycle 
indicates 
that 
normalization 
is not 
complete 
and the 
instruction 
should 
be 


repeated). 


Instr 
Oprd 
Oprd 
Oprd Sel 
Dest 
Destination Selects 


Code 
Addr 
Add, 
EB1- 
Addr 
WE3- 
SELRF1- 
OEY3- 
CF2- 


17-10 
A5-AO 
B5-BO 
EA EBO 
C5-CO 
SELMQ 
WEO 
SELRFO OEA 
OEB 
OEYO 
OES 
Cn 
CFO 


0011 0000 
XX 
XXX 
X 
000011 
X 
00 
000011 
0 
0000 
10 
X 
X 
XXXX 
0 
X 
110 


Source 
1111 10100111 
0101 
1101 100001001110 
I 
ALU shifter 
+- RF(3) 


Source 
0011 0111 
1111 0110 
1101 100001000011 
I 
MQ shifter 
+- MQ register 


Destination 
11110100111010111011000010011101 
I 
8RF(3) +- Result (MSH) 


Destination 
0110 
1111 1110 1101 1011 000010000110 
I 
MQ register 
+- Result (LSH) 


G 
OVR 
+- ot 


The contents of the divide/BCD flip-flops are passed through the MQ register to the 
Y output Imultiplexer. 


C3-CO 


RF 
A3-AO 
.. 


DA-Port 


(A5-AOI 
Immed 
A3-AO 


Mask 


No 
No 
No 
No 


RF 
MQ 


DB-Port 
(B5-BO) 
Register 


No 
No 
No 


RF 
RF 
V-Port 
(C5-CO) 
(B5-BO) 


No 
No 
Ves 


I 


ZER~ 
~ 


OVR 
= 0 


Cn 
= 0 


Inst' 
Op,d 
Op,d 
Op,d Sel 
Dest 
Destination 
Selects 


Code 
Add, 
Add, 
EB1- 
Add, 
WE3- 
SELRF1- 
OEY3- 
CF2- 


17-10 
A5-AO 
B5-BO 
EA EBO 
C5-CO 
SELMO 
WEO 
SELRFO OEA 
OEB 
OEYO 
OES 
Cn 
CFO 


0101 
1111 
XX 
XXX 
X 
XX 
XXXX 
X 
XX 
XX 
XXXX 
1 
XXXX 
XX 
X 
X 
0000 
X 
X 
110 


Source 
0010101000000101 
0101 
01000111 
0000 I 
MQ register +- Divide/BCD flip-flops 


Destination 
00101010000001010101 
010001110000 
I 
Y output 
+- MQ register 


This instruction corrects excess-3 additions or subtractions in the byte mode. For 
correct excess-3 arithmetic, this instruction must follow each add or subtract. The 
operand must be on the 5 bus. 


Data on the 5 bus is added to a constant on the R bus determined by the state of 
the BCDflip flops and previous overflow condition reported on the 55F pin. Bytes with 
510 inputs programmed low evaluate the correct excess-3 representation. Bytes with 
510 inputs programmed high or floating, pass 5 unaltered. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
(A5-AO) 
Immed 
A3-AO 


Mask 


No 
No 
No 
No 


RF 
MQ 
DB-Port 
(B5-BO) 
Register 


Yes 
No 
No 


RF 
RF 
Y-Port 
(C5-CO) 
(B5-BO) 


Yes 
No 
No 


Signal 
User 
Use 
Programmable 


SSF 
No 
Inactive 


SIOO 
Yes 
Byte select 


SI01 
Yes 
Byte select 


SI02 
Yes 
Byte select 


SI03 
Yes 
Byte select 


Cn 
No 
Inactive 


ZERO 


N 


OVR 
en 


o 
o 


1 if arithmetic signed overflow 


1 if carry-out = 1 


Add two BCD numbers and store the sum in register 3. Assume data comes in on 
DB bus. 


1. Clear accumulator (SUB ACC, ACC) 
2. Store 33 (Hex) in all bytes of register (SET1 R2, H/33f) 
3. Add 33 (Hex) to selected bytes of first BCD number (BADD DB, R2, R1) 
4. Add 33 (Hex) to selected bytes of second BCD number (BADD DB, R2, R3) 
5. Add selected bytes of registers 1 and 3 (BADD, R1, R3, R3) 
6. Correct the result (EX3BC, R3, R3) 


Instr 
Oprd 
Oprd 
Oprd Sel 
Dest 
Destination 
Selects 


Code 
Addr 
Addr 
E81- 
Addr 
WE3- 
SELRF1- 
0EY3- 
CF2- 
SiD3- 
iESi03- 


WEO 
SELRFO OEA - 
OEYO 
DES 
SiDo 
iESiOo 
17-10 
A5-AO 
85-80 
EA E80 
C5-CO 
SELMQ 
OE8 
Cn CFO 


11110010 
000010 
XX XXXX 
0 
XX 
000010 
0 
0000 
10 
X 
X 
XXXX 
0 
1 
110 
XXXX 
XXXX 


00001000 
000010 
XX XXXX 
0 
XX 
000010 
0 
0000 
10 
X 
X 
XXXX 
0 
X 
110 
XXXX 
XXX X 


10001000 
000010 
XX XXXX 
0 
10 
00 0001 
0 
0000 
10 
X 
X 
XXXX 
0 
0 
110 
1100 
0000 


1000 
1000 
000010 
XX XXXX 
0 
10 
000011 
0 
0000 
10 
X 
X 
XXXX 
0 
0 
110 
1100 
0000 


1000 
1000 
000001 
000011 
0 
00 
000011 
0 
0000 
10 
X 
X 
XXXX 
0 
0 
110 
1100 
0000 


10001111 
XX XXXX 
000011 
X 
00 
000011 
0 
0000 
10 
X 
X 
XXXX 
0 
0 
110 
1100 
0000 


Assume DB bus holds 51336912 
at third 
instruction 
and 34867162 
at fourth 


instruction. 


This instruction corrects excess-3 additions or subtractions in the word mode. For 
correct excess-3 arithmetic, this instruction must follow each add or subtract. The 
operand must be on the S bus. 


Data on the S bus is added to a constant on the R bus determined by the state of 
the BCD flip-flops and previous overflow condition reported by the SSF pin.• 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
.. 


(A5-AO) 
Immed 
A3-AO 


Mask 


No 
No 
No 
No 


RF 
MQ 
DB-Port 
(B5-BO) 
Register 


Yes 
No 
No 


RF 
RF 
Y-Port 
(C5-CO) 
(B5-BO) 


Yes 
No 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
No 
Inactive 


5100 
No 
Inactive 


5101 
No 
Inactive 


5102 
No 
Inactive 


5103 
No 
Inactive 


Cn 
No 
Inactive 


ZERO 


N 


OVR 
en 


o 


1 if MSB = 1 


1 if arithmetic signed overflow 


1 if carry-out 
= 1 


Add two BCD numbers and store the sum in register 3. Assume data comes in on 
DA bus. 


1. Clear accumulator (SUB ACC, ACC) 
2. Store 33 (Hex) in all bytes of register (SET1 R2, H/331l 
3. Add 33 (Hex) to all bytes of first BCD number (ADD DB, 112,R1) 
4. Add 33 (Hex) to all bytes of second BCD number (ADD DB, R2, R3) 
5. Add the excess-3 data (ADD, R1, R3, R3) 
6. Correct the excess-3 result (EX3C, R3, R3) 
7. Subtract the excess-3 bias to go to BCD result. 


Instr 
Oprd 
Oprd 
Oprd Sel 
Dest 
Destination 
Selects 


Code 
Addr 
Addr 
EB1- 
Addr 
WE3- 
SELRF1- 
OEY3- 
CF2- 


17-10 
A5-AO 
B5-BO 
EA EBO 
C5-CO 
SELMQ 
WEO 
SELRFO OEA 
OEB 
OEYO 
OES 
Cn 
CFO 


11110010 
00 0010 
XX 
XXXX 
a 
XX 
00 0010 
a 
0000 
10 
X 
X 
XXXX 
a 
1 
110 


0000 
1000 
00 0010 
XX 
XXXX 
a 
XX 
00 0010 
a 
0000 
10 
X 
X 
XXXX 
a 
X 
110 


11110001 
00 0010 
XX 
XXXX 
a 
10 
00 0001 
a 
0000 
10 
X 
X 
XXXX 
a 
a 
110 


11110001 
00 0010 
XX 
XXXX 
a 
10 
00 0011 
a 
0000 
10 
X 
X 
XXXX 
a 
a 
110 


11110001 
00 0001 
00 0011 
a 
00 
00 0011 
a 
0000 
10 
X 
X 
XXXX 
a 
a 
110 


1001 1111 
XX 
XXXX 
00 0011 
X 
00 
00 0011 
a 
0000 
10 
X 
X 
XXXX 
a 
a 
110 


11110010 
00 0010 
00 0011 
a 
00 
00 0011 
a 
0000 
10 
X 
X 
XXXX 
a 
a 
110 


Assume 
DB bus holds 51336912 
at third instruction 
and 34867162 
at fourth 
instruction. 


0000 0000 0000 0000 0000 0000 0000 0000 I 
R F (2) •....0 


7 
10000110001000000100000001110100 
I 
RF(3) •....RF(3)-RF(2) 


Increment Negative R using Carry (R' + Cn) 


Data on the R bus is inverted and added with carry. The result appears at the ALU 
and MQ shifters. 


"The result of this instruction can be shifted in the same microcycle by specifying a shift instruction in the 
upper nibble (17-14)of the instruction field. The result may also be passed without shift. Possible instructions 
are listed in Table 15. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
.. 


(A5-AO) 
Immed 
A3-AO 


Mask 


Yes 
No 
Yes 
No 


RF 
MQ 
DB-Port 
(B5-BO) 
Register 


No 
No 
No 


RF 
RF 
ALU 
MQ 
Y-Port 
(C5-CO) (B5-BO) 
Shifter 
Shifter 


Yes 
No 
Yes 
Yes 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
No 
Affect 
shift instructions 
programmed 
in bits 17-14 of 


SIOO 
No 
instruction 
field. 


SI01 
No 


SI02 
No 


SI03 
No 


Cn 
Yes 
Increments if programmed 
high. 


ZERO 


N 


OVR 


C 


= 1 if result = 0 


= 
1 if MSB 
= 1 


1 if signed arithmetic overflow 


= 1 if carry-out = 1 


tc is ALU carry out and is evaluated before shift operation. ZEROand N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 


lnstr 
Oprd 
Oprd 
Oprd Sel 
Dest 
Destination 
Selects 


Code 
Addr 
Addr 
EB1- 
Addr 
WE3- 
SELRF1- 
OEY3- 
CF2- 


17-10 
A5-AO 
B5-BO 
EA EBO 
C5-CO 
SELMQ 
WEO 
SELRFO 
OEA 
OEB 
OEYO 
OES 
Cn 
CFO 


11110111 
XX 
XXXX 
XX 
XXXX 
1 
XX 
000100 
0 
0000 
10 
X 
X 
XXXX 
0 
1 
110 


Data on the S bus is inverted and added to the carry. The result appears at the ALU 
and MQ shifters. 


"The result of this instruction can be shifted in the same microcycle by specifying a shift instruction in the 
upper nibble (17-14)of the instruction field. The result may also be passed without shift. Possible instructions 
are listed in Table 15. 


C3-CO 


RF 
A3-AO 
.. 
DA-Port 
.. 


(A5-AO) 
Immed 
A3-AO 


Mask 


No 
No 
No 
No 


RF 
MQ 
DB-Port 
(B5-BO) 
Register 


Yes 
Yes 
Yes 


RF 
RF 
ALU 
MQ 
Y-Port 
(C5-CO) (B5-BO) 
Shifter 
Shifter 


Yes 
No 
Yes 
Yes 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
No 
Affect 
shift instructions 
programmed 
in bits 17-14 of 


SIOO 
No 
instruction 
field. 


SI01 
No 


SI02 
No 


SI03 
No 


Cn 
Yes 
Increments if programmed 
high. 


ZERO 


N 


OVR 


C 


1 if result = 0 


1 if MSB = 1 


1 if signed arithmetic overflow 


1 if carry-out = 1 


tc is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 


Convert the data on the MQ register to one's complement and store the result in 
register 4. 


Inst, 
Op,d 
Op,d 
Op,d Sel 
Dest 
Destination Selects 


Code 
Add, 
Add, 
EB1- 
Addr 
WE3- 
SELRF1- 
OEY3- 
CF2- 


17-10 
A5-AO 
B5-BO 
EA EBO 
C5-CO 
SELMO 
WEO 
SELRFO 
OEA 
DEB 
OEYO 
DES 
Cn 
CFO 


11110101 
XX 
XXXX 
XX 
XXXX 
X 
11 
000100 
0 
0000 
10 
X 
X 
XXXX 
0 
0 
110 


"The result of this instruction can be shifted in the same microcycle by specifying a shift instruction in the 


upper nibble (17-14)of the instruction field. The result may also be passed without shift. Possible instructions 
are listed in Table 15. 


C3-CO 


RF 
A3-AO 
.. 


DA-Port 
.. 


(A5-AO) 
Immed 
A3-AO 


Mask 


Yes 
No 
Yes 
No 


Available S Bus Source 
Operands 
(MSH) 


RF 
MQ 
DB-Port 
(B5-BO) 
Register 


No 
No 
No 


RF 
RF 
ALU 
MQ 
Y-Port 


(C5-CO) 
(B5-BO) 
Shifter 
Shifter 


Yes 
No 
Yes 
Yes 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
No 
Affect 
shift 
instructions 
programmed 
in bits 17-14 of 


SIOO 
No 
instruction 
field. 


SI01 
No 


SI02 
No 


SI03 
No 


Cn 
Yes 
Increments 
R if programmed 
high. 


Increment 
R using Carry 
(R + Cn) 


ZERO 


N 


OVR 


Cn 


if result = 0 


if MSB = 1 


if signed arithmetic overflow 


if carry-out 
= 1 


t C is ALU carry-out 
and is evaluated 
before 
shift 
operation. 
ZERO and N (negative) 
are evaluated 
after 
shift 
operation. 
OVR (overflowl 
is evaluated 
after 
ALU 
operation 
and after 
shift 
operation. 


Instr 
Dprd 
Dprd 
Dprd Sel 
Dest 
Destination 
Selects 


Code 
Add, 
Add, 
EB1- 
Add, 
WE3- 
SELRF1- 
DEY3- 
CF2- 


17-10 
A5-AO 
B5-BO 
EA EBO 
C5-CO 
SELMQ 
WEO 
SELRFO DEA 
DEB 
DEYO 
DES 
Cn 
CFO 


11110110 
XX 
XXX 
X 
XX 
XXXX 
1 
XX 
000100 
0 
0000 
10 
X 
X 
XXXX 
0 
1 
110 


Destination I 0001011110010001111111101)110111 
I 
RF(4) +- R + Cn 


Increment 
5 using Carry (5 + Cn) 


"The 
result 
of this 
instruction 
can be shifted 
in the same microcycle 
by specifying 
a shift 
instruction 
in the 
upper nibble (17-14) of the instruction 
field. 
The result may also be passed without 
shift. 
Possible instructions 
are listed 
in Table 
1 5. 


C3-CO 


RF 
A3-AO 
.. 


DA-Port 
.. 


(A5-AO) 
Immed 
A3-AO 


Mask 


No 
No 
No 
No 


RF 
MQ 
DB-Port 
(B5-BO) 
Register 


Yes 
Yes 
Yes 


RF 
RF 
ALU 
MQ 
Y-Port 
(C5-CO) (B5-BO) 
Shifter 
Shifter 


Yes 
No 
Yes 
Yes 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
No 
Affect 
shift instructions 
programmed in bits 17-14of 


SIOO 
No 
instruction 
field. 


SIOl 
No 


SI02 
No 


SI03 
No 


Cn 
Yes 
Increments S if programmed 
high. 


Increment 
S using Carry 
(S + Cn) 


ZERO 


N 


OVR 


C 


if result = 0 


if MSB = 1 


if signed arithmetic overflow 


if carry-out = 1 


t C is AlU 
carry-out 
and is evaluated 
before 
shift 
operation. 
ZERO and N (negative) 
are evaluated 
after 
shift 
operation. 
OVR (overflow) 
is evaluated 
after 
AlU 
operation 
and after 
shift 
operation. 


Instr 
Oprd 
Oprd 
Oprd Sel 
Dest 
Destination Selects 


Code 
Add, 
Addr 
E61- 
Addr 
WE3- 
SELRF1- 
OEY3- 
CF2- 


17-10 
A5-AO 
65-60 
EA E60 
C5-CO 
SELMQ 
WEO 
SELRFO OEA 
OE6 
OEYO 
OES 
Cn 
CFO 


11110100 
XX 
XXXX 
XX 
XXXX 
X 
11 
00 0100 
a 
0000 
10 
X 
X 
XXXX 
a 
1 
110 


Destination 
0101 0100 
1111 1111 0000 0001 0000 0000 
I 
RF(4) +- S + Cn 


Uses an internal bypass path to load data from the S MUX directly into the divide/BCD 
flip-flops. 


C3-CO 


RF 
A3-AO 
.. 


DA-Port 
.. 


(A5-AO) 
Immed 
A3-AO 


Mask 


No 
No 
No 
No 


RF 
MQ 
DB-Port 
(B5-BO) 
Register 


Yes 
Yes 
Yes 


RF 
RF 
ALU 
MQ 
Y-Port 
(C5-CO) (B5-BO) 
Shifter 
Shifter 


No 
No 
No 
No 
No 


User 
Signal 
Programmable 
Use 


SSF 
No 
Inactive 


SIOO 
No 
Inactive 


SIOl 
No 
Inactive 


SI02 
No 
Inactive 


SI03 
No 
Inactive 


Cn 
No 
Inactive 


Load Divide/BCD 
Flip-Flops 


IZERO 
= 0 
N 
= 0 


OVR 
0 


C 
0 


Inst, 
Op,d 
Op,d 
Op,d Sel 
Oest 
Destination 
Selects 


Code 
Add, 
Add, 
EB1- 
Add, 
WE3- 
SELRF1- 
0EY3- 
CF2- 


17-10 
A5-AO 
B5-BO 
EA EBO 
C5-CO 
SELMQ 
WEO 
SELRFO OEA 
OEB 
OEYO DEs 
Cn 
CFO 


00001111 
XX 
XXXX 
XX 
XXXX 
X 
10 
XX 
XXXX 
X 
XXXX 
XX 
X 
X 
XXXX 
X 
X 
110 


Source 
0010 
1010 0000 
1000 
110001100001 
1000 Is .....DB bus 


Destination 
0010101000001000110001100001 
1000 I 
Divide/BCD flip-flops 
•....S 


Pass (Y - 
F) and Load MQ with F 


Passesthe result of the ALU instruction specified in the lower nibble of the instruction 
field to Y and the MQ register. 


The result of the arithmetic or logical operation specified in the lower nibble of the 
instruction field (13-10)is passed unshifted to Y and the MQ register. 


ALU Shifter 
MO Shifter 


None 
None 


RF 
RF 
Y-Port 
(C5-CO) 
(B5-BO) 


Yes 
No 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
No 
Holds MOO (LSB) 


SIOO 
No 
Inactive 


SI01 
No 
Inactive 


SI02 
No 
Inactive 


SI03 
No 
Inactive 


Cn 
No 
Inactive 


ZERO 


N 


OVR 


C 


1 if result 
= 0 


1 if MSB of result 
= 1 
o if MSB of result 
= 0 


if signed arithmetic 
overflow 


if carry-out 
= 1 


t C is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 
after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 


Load the MQ register with data from register 1, and pass the data to the Y port. 


(In this example, data is passedto the ALU by and INCRinstruction without carry-in.) 


Instr 
Oprd 
Oprd 
Oprd Sel 
Dest 
Destination 
Selects 


Code 
Addr 
Addr 
EB1- 
Addr 
WE3- 
SELRF1- 
OEY3- 
CF2- 


17-10 
A5-AO 
B5-BO 
EA EBO 
C5-CO 
SELMQ 
WEO 
SELRFO 
OEA 
OEB 
OEYO 0fS 
Cn 
CFO 


11110110 
000001 
XX 
XXXX 
0 
XX 
XX 
XXXX 
0 
XXXX 
XX 
X 
X 
XXXX 
0 
0 
110 


Source 
00101010000010001100011000011000 
I 
R +- RF(1) 


Destination 
001010100000 
1000 
110001100001 
1000 I 
MQ register 
+- R + en 


Passesthe result of the ALU instruction specified in the upper nibble of the instruction 
field to Y MUX. Performs a circular left shift on MQ. 


The result of the arithmetic or logical operation specified in the lower nibble of the 
instruction field (13-10)is passed unshifted to Y MUX. 


The contents of the MQ register are rotated one bit to the left. The MSB is rotated 
out and passed to the LSB of the same word, which may be 1, 2, or 4 bytes long. 


The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to the MQ register. If SSF is low, the MQ register will not be altered. 


ALU Shifter 
MQ Shifter 


None 
Circular Left 


RF 
RF 
Y-Port 
(C5-CO) 
(B5-BO) 


Yes 
No 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
Yes 
Passes shift result if high or floating; 
retains MQ 


without 
shift if low. 


SIOO 
No 
Inactive 


SI01 
No 
Inactive 


SI02 
No 
Inactive 


SI03 
No 
Inactive 


Cn 
No 
Affects 
arithmetic operation programmed in bits 


13-10of instruction field. 


ZERO 


N 


OVR 


C 


1 if result = 0 


1 if MSB of result = 1 
o if MSB of result = 0 


1 if signed arithmetic overflow 


1 if carry-out = 1 


tc is ALU carry-out 
and is evaluated 
before 
shift 
operation. 
ZERO and N (negative) 
are evaluated 
after 
shift 
operation. 
OVR (overflow) 
is evaluated 
after 
ALU operation 
and after 
shift 
operation. 


Add data in register 1 to data on the DB bus with carry-in and store the unshifted 
result in register 1, Circular shift the contents of the MQ register one bit to the left. 


Instr 
Dprd 
Op,d 
Dprd Sel 
Dest 
Destination 
Selects 


Code 
Add, 
Add, 
EB1· 
Add, 
WE3- 
SELRF1- 
DEY3- 
CF2- 


17-10 
A5-AO 
B5-BO 
EA EBO 
C5·CO 
SELMQ 
WEO 
SELRFO 
DEA 
DEB 
OEYO 
OES 
Cn 
CFO 


11010001 
00 0001 
XX xxxx 
a 
10 
00 0001 
a 
0000 
10 
X 
X 
xxxx 
a 
1 
110 


Assume register file 1 holds 2508C618 
(Hex). DB bus holds 11007530 
(Hex). and 
MQ register holds 4DA99AOE (Hex), 


Pass (Y - 
F) with Logical Left MQ Shift 


Passesthe result of the ALU instruction specified in the upper nibble of the instruction 
field to Y MUX. Performs a left shift on MO. 


The result of the arithmetic or logical operation specified in the lower nibble of the 
instruction field (13-10)is passed unshifted to Y MUX. 


The contents of the MO register are shifted one bit to the left. A zero is filled into 
the least significant bit of each word unless the SID input for that word is programmed 
low; this will force the least significant bit to one. The MSB is dropped from each word, 
which may be 1, 2, or 4 bytes long, depending on the configuration selected. 


The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to the MO register. If SSF is low, the MO register will not be altered. 


ALU Shifter 
MQ Shifter 


None 
Logical Left 


RF 
RF 
Y-Port 
(C5-CO) (B5-BOI 


Yes 
No 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
Yes 
Passes shift result if high or floating; retains MQ 


without shift if low. 


SIOO 
Yes 
Fills a zero in LSB of MQ shifter if high or floating; 


sets LSB to one if low. 


SI01 
No 
Inactive in 32-bit configuration; used in 


SI02 
No 
configurations to select end-fill in LSBs. 


SI03 
No 


Cn 
No 
Affects arithmetic operation programmed in bits 


13-10of instruction field. 


ZERO 


N 


1 if result 
= 0 


1 if MSB of result 
o if MSB of result 


OVR 
if signed arithmetic 
overflow 


C 
if carry-out 
= 1 


t C is ALU carry-out and is evaluated before shift operation. ZERO and N (negative) are evaluated 


after shift operation. OVR (overflow) is evaluated after ALU operation and after shift operation. 


EXAMPLE 
(assumes a 32-bit 
configuration) 


Add data in register 7 to data on the DB bus with carry-in and store the unshifted 
result in register 7. Shift the contents of the MQ register one bit to the left, filling 
a zero into the least significant bit. 


Instr 
Op,d 
Op,d 
Oprd 
Sel 
Dest 
Destination 
Selects 


WE3· 
SELRF1· 
- 
Code 
Addr 
Add, 
EB1· 
Add, 
OEY3- 
CF2- 
S103- 
IESI03- 
, 
- 
- 
17-10 
A5-AO 
B5-BO 
EA EBO 
C5-CO 
SELMQ 
WEO 
SELRFO 
OEA 
DEB 
OEYO 
DES 
Cn 
CFO 
SIOO 
IESIOO 


11000001 
000111 
XX 
XXXX 
0 
10 
000111 
0 
0000 
10 
X 
X 
XXXX 
0 
1 
110 
1111 
0000 


Assume register file 7 holds 7308C618 (Hex), DB bus holds 54007530 
(Hex), and 
MQ register holds 61A99AOE (Hex). 


Pass (Y - 
F) with Arithmetic Right MQ Shift 


Passesthe result of the ALU instruction specified in the upper nibble of the instruction 
field to Y MUX. Performs an arithmetic right shift on MO. 


The result of the arithmetic or logical operation specified in the lower nibble of the 
instruction field (13-10)is passed unshifted to Y MUX. 


The contents of the MO register are rotated one bit to the right. The sign bit of the 
most significant byte is retained. Bit 0 of the least significant byte is dropped. 


The shift may be made conditional on SSF. If SSF is high or floating, the shift result 
will be sent to the MO register. If SSF is low, the MO register will not be altered. 


ALU Shifter 
MQ Shifter 


None 
Arithmetic 
Right 


RF 
RF 
Y-Port 
(C5-CO) 
(B5-BO) 


Yes 
No 
Yes 


User 
Signal 
Use 
Programmable 


SSF 
Yes 
Passes shift result if high or floating; retains MQ 


without 
shift if low. 


SIOO 
No 
Outputs LSB of MQ shifter (inverted). 


SI01 
No 
Inactive in 32-bit configurations; 
used in other 


SI02 
No 
configurations to output LSBs from MQ shifter 


SI03 
No 
(inverted). 


Cn 
No 
Affects arithmetic operation programmed in bits 


13-10of instruction field. 


SN74ACT8818A 
16-Bit Microsequencer 


I 
SN74ACT8832A 
32-Bit Registered ALU 


SN74ACT8836A 
32- x 32-Bit Parallel Multiplier 
II 


• 
Performs Full 32-Bit by 32-Bit 
Multiply/Accumulate 
in Flowthrough 
Mode 
in 48 ns (MinI 


• 
Can be Pipelined for 30 ns (MinI Operation 


• 
Performs 54-Bit by 54-Bit Multiplication 
in 
Five Cycles 


• 
Supports Division Using Newton-Raphson 
Approximation 


• 
Signed. Unsigned. or Mixed-Mode 
Multiply 
Operations 


• 
Low-Power 
'-/lm 
EPIC'" CMOS 


SN74ACT8836A 
32·BIT 
BY 32-BIT 
MULTIPLIER/ACCUMULATOR 


• 
Multiplier. 
Multiplicand. 
and Product Can Be 
Complemented 


• 
Accumulator 
Bypass Option 


• 
TTL I/O Voltage Compatibility 


• 
Three Independent 32-Bit Buses for 
Multiplicand. 
Multiplier. 
and Product 


• 
Parity Generation/Checking 


• 
Master/Slave 
Fault Detection 


• 
Single 5-V Power Supply 


• 
Optional Integer or Fractional Rounding 


description 


The' ACT8836A 
is a 32-bit by 32-bit parallel multiplier/accumulator 
(MAC) suitable for low-power. 
high- 
speed operations 
in applications 
such as digital signal processing, 
array processing. 
and numeric data 
processing. 
High speed is achieved through the use of a Booth and Wallace Tree architecture. 


Data is input to the chip through two registered 32-bit DA and DB input ports and output through a registered 
32-bit Y output port. These registers have independent clock enable signals and can be made transparent 
for flowthrough 
operations. 


The device can perform two's 
complement. 
unsigned. and mixed-data arithmetic. 
It can also operate as 
a 64-bit 
by 64-bit multiplier. 
Five clock cycles are required to perform a 64-bit by 64-bit 
multiplication 
and also multiplex the 128-bit 
result. Division is supported using Newton-Raphson 
approximation. 


A multiply/accumulate 
mode is provided to add or subtract 
the accumulator 
from the product 
or the 
complement of the product. The accumulator is 67 bits wide to accommodate possible overflow. A warning 
flag (ETPERR)indicates whether 
overflow 
has occurred. 


A rounding feature in the' ACT8836A 
allows the result to be truncated or rounded to the nearest 32-bits. 


To ensure data integrity. 
byte parity checking is provided at the input ports. and a parity generator and 
master/slave 
error detection 
comparator 
are provided at the output port. 


The' ACT8836A 
is characterized 
for operation from O°C to 70°C. 


Two 32-bit input data ports. DA and DB. are provided for input of the multiplicand and multiplier to registers 
A and B and the multiplier/adder. 
Input data can be clocked to the A and B registers before being passed 
to the multiplier/adder 
if desired. Two multiplexers. 
Rand S. in conjunction 
with a flowthrough 
decoder 
select the multiplier operands from DA and DB inputs. A and B registers. or the temporary register. Data 
is supplied to the temporary 
register from a shifter that operates on external DA/DB data or a previous 


multiplier/adder result. The 67-bit multiplier/adder result can be output through the Y port or passed through 
the shifter to the accumulator. 


External DA and DB data is also available to the accumulator via the shifter. This 64-bit data can be extended 
with zeros or the sign bit. The 64 least significant 
bits from the shifter may also be latched in the 64-bit 
temporary register and input to the multiplier through the Rand S multiplexers. 
A swap option allows the 
most significant and least significant 32-bit halves of temporary register data to be swapped before being 
made available to the Rand S multiplexers. 
This allows either 32-bit half of the temporary 
register to be 
used as a multiplier. 


PROOUCTION OATA documents 
contain 
information 
current as of publication date. Products conform to 
specifications per the terms of Texas Instruments 
standard warranty. Production processing does nDt 
necessarily 
include testing of all parameters. 


. 
TEXAS ~ 
INsrRUMENTS 


SN74ACT8836A 
32·81T 8Y 32·81T MULTIPLIER/ACCUMULATOR 


pin descriptions 


Pin assignments 
and terminal functions 
for the' ACT8836A 
are given on the following 
pages. The pin at 
location 
M9 is omitted 
and a pin at M 10 is added for indexing purposes. 


15 x 15 GO PIN-GRID-ARRAY 
PACKAGE 


(TOP VIEW) 


2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 


A • • • • • • • • • • • • • • • 
• 
B • • • • • • • • • • • • • • • 


C • • • • • • • • • • • • • • • 


D • • • 
• • • 
• • • 


E • • • 
• • • 


F • • • 
• • • 


G • • • • 
• • • • 


H • • • • 
• • • • 


J • • • • 
• • • • 


K • • • 
• • • 


L • • • 
• • • 


M • • • 
• • 
• 
• • • 


N • • • • • • • • • • • • • • • 


p • • • • • • • • • • • • • • • 


R • • • • • • • • • • • • • • • 


TEXAS • 
INSTRUMENTS 


SN74ACT8836A 
32·BIT BY 32-BIT MULTIPLIER/ACCUMULATOR 


PIN 
PIN 
PIN 
PIN 
PIN 
PIN 


NO. 
NAME 
NO. 
NAME 
NO. 
NAME 
NO. 
NAME 
NO. 
NAME 
NO. 
NAME 


A1 
Y8 
B12 
YETP1 
D14 
PYO 
H12 
COMPl 
M3 
DB18 
P5 
DB25 
A2 
Y10 
B13 
YETPO 
D15 
ETPERR 
H13 
FTO 
M7 
PB1 
P6 
DB29 
A3 
Y11 
B14 
YETP2 
E1 
SElREG 
H14 
EA 
M8 
PAO 
P7 
DB31 
A4 
Y13 
B15 
PY3 
E2 
Y3 
H15 
CKEA 
M10 
DA6 
P8 
PERRA 
A5 
Y14 
C1 
YO 
E3 
GND 
J1 
DB2 
M13 
DA16 
P9 
PA2 
A6 
Y16 
C2 
Y4 
E13 
GND 
J2 
DB3 
M14 
DA17 
P10 
DA2 
A7 
Y18 
C3 
EB 
E14 
PY2 
J3 
DB5 
M15 
DA25 
P11 
DA8 
A8 
Y19 
C4 
Y5 
E15 
RND1 
J4 
DB7 
N1 
DB10 
P12 
DA12 


A9 
Y21 
C5 
VCC 
F1 
SFTO 
J12 
DA26 
N2 
DB19 
P13 
DA14 
A10 
Y23 
C6 
GND 
F2 
Yl 
J13 
DA24 
N3 
DB20 
P14 
DAll 
A11 
Y25 
C7 
Y15 
F3 
GND 
J14 
DA30 
N4 
DB21 
Pl'5 
DA21 
A12 
Y27 
C8 
GND 
F13 
GND 
J15 
DA31 
N5 
DB23 
Rl 
DB14 
A13 
Y28 
C9 
Y22 
F14 
MSERR 
Kl 
DB4 
N6 
DB27 
R2 
DB26 
A14 
Y30 
Cl0 
GND 
F15 
DASGN 
K2 
DB9 
N7 
VCC 
R3 
DB28 
A15 
PYl 
Cll 
VCC 
Gl 
SELD 
K3 
DBll 
N8 
GND 
R4 
DB30 
Bl 
Y2 
C12 
CKEY 
G2 
SGNEXT 
K13 
DA22 
N9 
DAO 
R5 
PBO 
B2 
Y6 
C13 
OEY 
G3 
WElS 
K14 
DA28 
Nl0 
DA4 
R6 
PB2 
B3 
SElY 
C14 
ACCO 
G4 
SFT1 
K15 
DA29 
Nll 
DA10 
R7 
PB3 
B4 
Y7 
C15 
PERRY 
G12 
RNDO 
L1 
DB6 
N12 
DA13 
R8 
PERRB 
B5 
Y9 
Dl 
WEMS 
G13 
DBSGN 
l2 
DB15 
N13 
DA15 
R9 
PA1 
B6 
Y12 
D2 
TPl 
G14 
CKEI 
l3 
DB13 
N14 
DA19 
Rl0 
PA3 
B7 
Y17 
D3 
TPO 
G15 
FT1 
L13 
DA18 
N15 
DA23 
Rll 
DAl 
B8 
Y20 
D7 
GND 
H1 
ClK 
l14 
DA20 
Pl 
DB12 
R12 
DA3 
B9 
Y26 
D8 
VCC 
H2 
CKEB 
L15 
DA27 
P2 
DB16 
R13 
DA5 
Bl0 
Y29 
D9 
Y24 
H3 
DBO 
M1 
DB8 
P3 
DB24 
R14 
DA7 
Bll 
Y31 
D13 
ACCl 
H4 
DBl 
M2 
DB17 
P4 
DB22 
R15 
DA9 
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DASGN 


DBSGN 


COMPL 


RNDO 


RND1 
ACCO 


ACC1 


SELREG 


WEMS 


WELS 


<I> 


32 • 32 MULTIPLIER/ 


ACCUMULATOR 
SN74ACT8836A 


(H11 
CLK 
DA PORT 
(B8) 


DB PORT 
1081 


(H1SI 
•.... 


DA REG 
PAR 
Y PORT 
(C1S) 


(H2) 
r--... 
STAT 
(E14) 
DB REG 
CLK 
MASTER/SLAVE 
(G141 
r--... 


I REG 
EN 
EQUAL CHK 
(D1S) 


(C121 
•..... 


Y REG 


0 
(014) 


(F1SI 
Y PORT 
~ 
(A1S) 


(G131 
PARITY 
~ 
r 
(E141 


(H12) 
I/O 
(B1S) 
3 
(G12) 
INSTR 
(E1S) 
INPUTS 
(M81 


(C141 
0 
(Q91 
(013) 
DA 
(P9) 
PORT 


PARITY 
3 
(Q101 


(F1 I 
INPUTS 
0 
IOS) 
o ISHIFTER 
(G4) 
1 
CONTROL 
(M7) 
DB 
(Q6) 
PORT 
(H13) 
(Q7) 
o IFEEDTHROUGH 
3 
(G1S) 
1 
CONTROL 


0 
(B131 


(B3) 
EXTENDED I 
(B12) 


(G1) 
YMUX 
PRECISION 
(B141 
DMUX 
INPUT 
2 
(H14) 
r--... 


RMUX 
SELECT 
(C3) 
•..... 


Y OUT/EN 
./I 
(C13) 
SMUX 


(E71 
(03) 


•..... 
RA or RB I 
TESTI 
0 
(02) 
(01) 
MS 32-BITS 
WRITE 
PINS 
1 
(G31 
•..... 
ENABLE 
LS 32-BITS 
., 
r 


0 
I DAT~ 
• 
• 
• 
• 
• 
• 


~ 


31 
0 
• 
• 
I RESULTS) 
• 
• 
0 
• 
• 
• 
• I DAT~ 


31 
• 
• 
• 
• 
31 


CKEA 


CKEB 


CKEI 


CKEY 


SFTO 


SFT1 


SELY 


SELD 
EA 


EB 


PERRA 


PERRB 


PERRY 


MSERR 


ETPERR 


YETPO 


YETP1 


YETP2 
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MULTIPLIER/ACCUMULATOR 


SGNEXT 


PA3-PAO 
4 
SELD 


PB3-PBO 
4 
2 
SFT1-SFTO 


PERRA 


PERRB 


2 
SELREG 


WEMS 


32 
WELS 


DA31-DAO 
32 
32 


CKEA 
DB31-DBO 


CKEB 


DASGN 


DBSGN 


RND1-RNDO 


COMPL 


ACC1-ACCO 


PIPELINE REGISTER 


MULTIPLIER/ADDER 
STAGE 2 


FT1-FTO 
• 


TP1-TPO • 


CLK ••••--- 
VCC. 
J4 
~9 
GND. 
I 


TEXAS ~ 
INSTRUMENlS 


SN74ACT8836A 
32·BIT 
BY 32·BIT 
MULTIPLIER/ACCUMULATOR 


PIN 
I/O 
DESCRIPTION 


NAME 
NO. 


ACCD 
C14 
Accumulate mode opcode (see Function Tables) 
I 
ACC1 
D13 


ClK 
H1 
I 
System clock 


CKEA 
H15 
I 
Clock enable for A register, active low 


CKEB 
H2 
I 
Clock enable for B register, active low 


CKEI 
G14 
I 
Clock enable for I register, active low 


CKEY 
C12 
I 
Clock enable for Y register, active low 


COMPl 
H12 
I 
Product complement control; high complements multiplier result, low passes multiplier unaltered 


to accumulator. 


DAO 
N9 


DA1 
R11 


DA2 
P10 


DA3 
R12 


DA4 
N10 


DA5 
R13 


DA6 
M10 


DA7 
R14 


DAB 
P11 


DA9 
R15 


DA10 
N11 


DA11 
P14 


DA12 
P12 


DA13 
N12 


DA14 
P13 


DA15 
N13 
I 
DA port input data bits 0 through 31 


DA16 
M13 


DA17 
M14 


DA18 
l13 


DA19 
N14 


DA20 
l14 


DA21 
P15 


DA22 
K13 


DA23 
N15 


DA24 
J13 


DA25 
M15 


DA26 
J12 


DA27 
l15 


DA28 
K14 


DA29 
K15 


DA30 
J14 


DA31 
J15 


DASGN 
F15 
I 
Sign magnitude control; high identifies DA input data as two's complement, low identifies DA input 


data as unsigned 


TEXAS ~ 
INSTRUMENTS 


SN74ACT8836A 
32·BIT 
BY 32-BIT 
MULTIPLIER/ACCUMULATOR 


PIN 
DESCRIPTION 
110 
NAME 
NO. 


DBa 
H3 


DB1 
H4 


DB2 
J1 


DB3 
J2 


DB4 
K1 


DB5 
J3 


DB6 
L1 


DB7 
J4 


DBB 
M1 


DB9 
K2 


DB10 
N1 


DB11 
K3 


DB12 
P1 


DB13 
L3 
, 


DB14 
R1 


DB15 
L2 
I 
DB port input data bits a throu9h 31 
DB16 
P2 


DB17 
M2 


DB1B 
M3 


DB19 
N2 


DB20 
N3 


DB21 
N4 


DB22 
P4 


DB23 
N5 


DB24 
P3 


DB25 
P5 


DB26 
R2 


DB27 
N6 


DB28 
R3 


DB29 
P6 


DB30 
R4 


DB31 
P7 


DBSGN 
G13 
I 
Sign 
magnitude 
control; 
high 
identifies 
DB input 
data 
as two's 
complement, 
low 
identifies 
DB input 


data as unsigned. 


EA 
H14 
I 
Core multiplier operand select. A high on this signal selects DA register for input on the R bus; a 


low selects the swap MUX. 


EB 
C3 
I 
Core multiplier operand select. A high on this signal selects DB register for input on the S bus; a 


low selects the swap MUX. 


ETPERR 
015 
0 
Equality check result. A low on this signal indicates that bits 67 through 64 of the core multiplier 


results are equal to bit 63. 


FTO 
H13 
I 
Feedthrough control signals for A, B, I, Pipeline and Y registers (see Function Tables). 
FTt 
G15 


TEXAS ~ 
INSTRUMENlS 


SN74ACT8836A 
32·BIT 
BY 32-BIT 
MULTIPLIER/ACCUMULATOR 


PIN 
DESCRIPTlDN 
I/O 
NAME 
NO. 


GND 
C6 


GND 
C8 


GND 
Cl0 


GND 
D7 


GND 
E3 
Ground pins. All ground pins should be used and connected. 


GND 
E13 


GND 
F3 


GND 
F13 


GND 
N8 


MSERR 
F14 
0 
Master/slave error flag. This signal goes high when the contents of the Y output multiplexer and 


the value at the external port are not equal. 


OEY 
C13 
I 
Y, YETP2-YETPO,and PY3-PYOoutput enable, active low. 


PAO 
M8 


PAl 
R9 
I 
Parity input data bus for DA input data 
PA2 
P9 


PA3 
Rl0 


PBO 
R5 


PBl 
M7 
I 
Parity input data bus for DB input data 
PB2 
R6 


PB3 
R7 


PYO 
D14 


PYl 
A15 
Y output parity data bus. Outputs data from parity generator (OEY = LI or inputs external parity 
I/O 
data (OEY = H). 
PY2 
E14 


PY3 
B15 


PERRA 
P8 
0 
DA port parity status pin. Goes high if even-parity test on any byte fails. 


PERRB 
R8 
0 
DB port parity status pin. Goes high if even-parity test on any byte fails. 


PERRY 
C15 
0 
Y port parity status pin. Goes high if even-parity test on any byte fails. 


RNDO 
G12 
I 
Multiplier/accumulator 
rounding control; high rounds integer result; low leaves result unaltered. 


RNDl 
E15 
I 
Multiplier/accumulator 
rounding control; high rounds fractional result; low leaves result unaltered. 


SELD 
Gl 
I 
D multiplexer select. High selects DA and DB ports; low selects multiplier core output. 


SELREG 
El 
I 
Write enablefor temporary register and accumulator. High enablesthe temporary register; low enables 


the accumulator. 


SELY 
B3 
I 
Y multiplexer select. High selects most significant 32 bits of Y register output; low selects least 


significant 32 bits. 


SGNEXT 
G2 
I 
Sign extend control for multiplexer. A low fills shift matrix bits 66-64 with zeros; a high fills DA3l 


in bits 66-64. 


SFTO 
Fl 
I 


SFT1 
G4 
Shift multiplexer control (see Function Tables). 


TPO 
D3 
I 
Test pins (see Function Tables) 
TPl 
D2 


VCC 
C5 


VCC 
Cll 


VCC 
D8 
Supply voltage (5 VI 


VCC 
N7 


WEMS 
Dl 
I 
Write enable for most significant 32 bits of temporary register and accumulator active low. 


WELS 
G3 
I 
Write enable for least significant 32 bits of temporary register and accumulator active low. 


TEXAS ~ 
INSTRUMENTS 


SN74ACT8836A 
32-81T 
8Y 32-81T 
MULTIPLIER/ACCUMULATOR 


PIN 
DESCRIPTION 
I/O 
NAME 
NO. 


YO 
C1 


Y1 
F2 


Y2 
B1 


Y3 
E2 


Y4 
C2 


Y5 
C4 


Y6 
B2 


Y7 
B4 


Y8 
A1 


Y9 
B5 


Y10 
A2 


Y11 
A3 


Y12 
B6 


Y13 
A4 


Y14 
A5 


Y15 
C7 
Y port data bus. Outputs data from Y register 10EY = L); inputs data to master/slave comparator 
I/O 
10EY = HI. 
Y16 
A6 


Y17 
B7 


Y18 
A7 


Y19 
A8 


Y20 
B8 
, 


Y21 
A9 


Y22 
C9 


Y23 
A10 


Y24 
09 


Y25 
A11 


Y26 
B9 


Y27 
A12 


Y28 
A13 


Y29 
B10 


Y30 
A14 


Y31 
B11 


YETPO 
B13 


YETP1 
B12 
I/O 


Data bus for extended precision product. Outputs three most significant bits of the 67-bit multiplier 


YETP2 
B14 
core 
result; 
inputs 
external 
data 
to master/slave 
comparator. 
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ACC' 
ACCO 
EA 
EB 
Operation 


0 
0 
X 
X 
±(R x SI + 0 


0 
1 
X 
X 
±(R x SI + ACC 


1 
0 
X 
X 
±(R x SI - 
ACC 


1 
1 
0 
0 
±1 
x 
1 + 0 


1 
1 
0 
1 
± 1 x DB + 0 


1 
1 
1 
0 
±DA 
x 
1 + 0 


1 
1 
1 
1 
±DA 
x DB + 0 


SFT1 
SFTO 
Shifter Operation 


L 
L 
Pass data without shift 


L 
H 
Shift one bit left; fill with zero 


H 
L 
Swap upper and lower halves of temporary register 


H 
H 
Shift 32 bits right; fill with sign bit 


Control Inputs 
Registers Bypassed 


FT1 
FTO 
Pipeline 
Y 
I 
A 
B 


L 
L 
Yes 
Yes Yes Yes Yes 


L 
H 
Yes 
No 
No 
No 
No 


H 
L 
Yes 
Yes 
No 
No 
No 


H 
H 
No 
No 
No 
No 
No 


TP' 
TPO 
Operation 


L 
L 
All outputs and lias forced low 


L 
H 
All outputs and lias forced high 


H 
L 
All outputs placed in a high impedance state 


H 
H 
Normal operation (default state) 
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architectural 
elements 


Included in the functional 
block diagram of the' ACT8835A 
are the following 
blocks. 


1. Two 32-bit 
registered input data ports DA and DB 
2. A parity checker at the DA and DB inputs 
3. An instruction 
decoder (I register) 
4. 
A flowthrough 
decoder that permits selected registers to be bypassed to support up to three 


levels of pipelining 
5. 
Rand S multiplexers to select operands for the multiplier/ adder from DA and DB inputs, registers 
A and B, or temporary 
register 
5. 
A 0 multiplexer that selects the operand for the shifter from the 57-bit sign-extended 
DA and 
DB inputs or the multiplier/adder 
output 


7. A shifter block that operates on DA/DB input data or on multiplier/adder 
outputs for scaling or 
Newton-Raphson 
division 
8. 
A Y output multiplexer that selects the most significant 
half or the least significant 
half of the 
multiplier/ 
adder result for output 
at the registered Y port 
9. An extended precision error check that tests for overflow 


10. 
A master/slave comparator and parity generator/comparator 
at the Y output port for master/slave 
and parity checking 


11. 
Registers at the external data and instruction 
input ports and the shifter and multiplier/adder 


output 
port to support pipelining 


input data parity checker 


An even-parity 
check is performed on each byte of input data at the DA, DB and Y ports. If the parity 
test fails for any byte, a high appears at the parity error output pin (PERRA for DA data, PERRBfor DB 
data, PERRY for Y data). 


A and B registers 


Register A can be loaded with data from the DA bus, which normally holds a 32-bit multiplicand. 
Register 


B is loaded from the DB bus which holds a 32-bit multiplier. Separate clock enables, CKEA and CKEB, allow 
the registers to be loaded separately. This is useful when performing 
double precision multiplication 
or 
using the temporary 
register as an input to the multiplier/adder. 
The registers can be made transparent 


using the FT inputs (see Function Tables). 


instruction 
register 


Instruction 
inputs to the device are shown in Table 1. These signals control signed, unsigned, and mixed 


multiplication 
modes, fractional 
and integer rounding, 
accumulator 
operations, 
and complementing 
of 
products. 
They can be latched into instruction 
register I when clock enable CKEI is low. 


Signal 
High 
Low 


DASGN 
Identifies DA Input data as two's 
complement 
Identifies DA input data as unsigned 


DBSGN 
Identifies DB input data as two's 
complement 
Identifies DB input data as unsigned 


RNDO 
Rounds integer result 
Leaves integer result unaltered 
RNDl 
Rounds fractional result 
Leaves fractional result unaltered 


COMPL 
Complements the product from the multiplier 
Passes the product from the multiplier to the 
before passing it to the accumulator 
accumulator unaltered 


ACCO 


ACCl 
See Function Tables 
See Function Tables 
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Rounding inputs RNDOand RND1 control rounding operations in the multiplier/adder. 
A low on these inputs 
passes the results unaltered. 
If a high appears on RND1, the result will be rounded by adding a 1 to bit 
30. RND 1 should be set high if the multiplier/adder 
result is to be shifted 
in order to maintain 
precision 
of the least significant bit following the shift operation. If a high appears on RNDO, the result will be rounded 
by adding a one to bit 31. This code should be used when the adder result will not be shifted. 


A complement 
control, 
COMPL, is used to complement 
the product 
from the muliplier before passing it 
to the accumulator. 
The complement 
will occur if COMPL is high; the product 
will be passed unaltered 


if COMPL is low. 


ACC 1-ACCO control the operation of the multiplier/adder. 
Possible operations 
are shown in the function 
tables. 


PA3-PAO 


PB3·PBO 


PERRA 


PERRB 


DA31-DAO 


CKEI 


EA 


DASGN 


DBSGN 


RND1-RNDO 


COMPl 
2 
ACC1-ACCO 
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R, S, and swap multiplexers 


The Rand S multiplexers 
select the multiplier/adder 
operands from external data or from the temporary 
register. 


When EA is low, the R multiplexer selects data from the swap multiplexer. When EA is high, the R multiplexer 
selects data from DA or the A register, depending on the state of the f1owthrough control inputs (see Function 
Tables). When EB is low, the S multiplexer 
selects data from the swap multiplexer. 
When EB is high, 
the S multiplexer switches data from DB or the B register, depending on the state of the flowthrough 
control 


inputs. 


EA and EB are also used in conjunction 
with the multiplier/adder 
control 
inputs to force a numeric one 
on the R or S inputs (see Function Tables). 


The swap multiplexers 
are controlled 
by the shifter control inputs. When SFT1 is high and SFTO is low, 


the most significant 
half of the temporary register is available to the S multiplexer, 
and the least significant 
half is available to the R multiplexer. 
When SFT1-SFTO are set to other values, the most significant 
half 
of the temporary 
register is available to the R multiplexer, 
and the least significant 
half is available to the 
S multiplexer. 


multiplier/adder 


The multiplier performs 32-bit multiplication 
and generates a 67-bit product. The product can be latched 
in the pipeline to increase cycle speed. The product is complemented 
when COMPL is set high as shown 
in Table 1. The adder computes the sum or the difference of the accumulator 
and the product and gives 
a 67-bit 
sum. Bits 66-64 
are used for overflow 
and sign extension. 


D multiplexer 


The D multiplexer 
selects input data for the shifter. Two sources are available to the multiplexer: 
a 64-bit 
word formed by concatenating 
DA and DB bus data, and the 67-bit sum from the multiplier/adder. 
If SELD 


is high, external DA/DB data is selected; 
if SELD is low, the sum is selected. 


If the 64-bit word is selected for input to the shifter, three bits are added to the word based on the state 
of the sign extend signal (SGNEXT). If SGNEXT is low, bits 66-64 are zero-filled; 
if SGNEXT is high, bits 
66-64 
are filled with the value on DA31. 


temporary register and accumulator 


Output from the shifter will be stored in the temporary 
register if SELREGis high and in the accumulator 
register if SELREGis low. The 64-bit temporary 
register can be used to store temporary 
data, constants, 


and scaled binary fractions 
(see Figure 2). 


Separate clock controls, WELS and WEMS, allow the most significant 
and least significant 
halves of the 
shifter output to be loaded separately. The 32 least significant bits of the selected register are loaded when 
WELS is low; the most significant 
bits when WEMS is low. When WELS and WEMS are both low, the 
entire word from the shifter 
is loaded into the selected register. 


The shifter can be used to multiply 
by two for Newton-Raphson 
operations or perform a 32-bit shift for 
double precision multiplication. 
The shifter is controlled by two SFT inputs, as shown in the function tables. 


Y register 


Final or intermediate 
multiplier/adder 
results will be clocked into Y register when CKEY is low. 
Results 
can be passed directly to the Y output multiplexer using flowthrough 
decoder signals to bypass the register 


(see Function Tables). 
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SGNEXT 
SELD 


2 
SFT1·SFTO 


2 
SELREG 


WEMS 
WELS 
32 


32 


32 
32 
DA31·DAO 
0831·080 


CKEA 
CKE8 


MULTIPLIER/ADDER 
STAGE 1 


PIPELINE REGISTER 


MULTIPLIER/ADDER 
STAGE 2 


Y multiplexer and Y output multiplexer 


The Y multiplexer 
allows the 64-bit 
result or the contents 
of the Y register to be switched 
to the Y bus, 
depending upon the state of the flowthrough 
control outputs. 
The upper 32 bits are selected for output 
when the Y output multiplexer 
control SELY is high; the lower 32 bits are selected for output 
when SELY 
is low. Note that the Y output multiplexer 
can be switched 
at twice the clock rate so that the 64-bit result 
can be output 
in one clock cycle. 


flowthrough 
decoder 


To enable the device to operate in pipelined or flowthrough 
modes, on-chip registers can be bypassed using 
flowthrough 
control 
signals FT1 and FTO. Up to three levels of pipeline can be supported, 
as shown in 
the function 
tables. 


extended precision check 


Three extended 
product 
outputs, 
YETP2-YETPO, are provided to recover three bits of precision during 
overflow. 
An extended precision check error signal (ETPERR)goes high whenever overflow 
occurs. If sign 
controls 
DASGN and DBSGN are both low, indicating 
an unsigned operation, the extended precision bits 
66-64 
are compared 
for equality. 
Under all other sign control 
conditions, 
bits 66-63 
are compared 
for 
equality. 
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PIPELINE REGISTER. 


MULTIPLIER/ADDER 
STAGE 2 


Y31·YO 


FIGURE 3. 
OUTPUT 


extended precision check 


Three extended 
product 
outputs, 
YETP2-YETPO, are provided to recover three bits of precision 
during 
overflow. 
An extended precision check error signal (ETPERR)goes high whenever overflow 
occurs. If sign 
controls 
DASGN and DBSGN are both low, indicating 
an unsigned operation, 
the extended precision bits 
66-64 
are compared for equality. 
Under all other sign control 
conditions, 
bits 66-63 
are compared for 
equality. 
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master slave comparator 


A master/slave 
comparator 
is provided to compare data bytes from the Y output 
multiplexer 
with 
data 
bytes on the external Y port when OEY is high. A comparison of the three extended precision bits of the 
multiplier/adder 
result or Y register output 
with 
external 
data in the YETP1-YETPO port is performed 
simultaneously. 
If the data is not equal, a high signal is generated on the master slave error output 
pin 
(MSERR). A similar comparison 
is performed for parity using the PY3-PYO inputs. This feature is useful 
in fault-tolerant 
design where several devices vote to ensure hardware 
integrity. 


test pins 


Two pins, TP1-TPO, support system testing. 
These may be used, for example, to place all outputs 
in a 
high-impedance 
state, isolating the chip from the rest of the system (see Function Tables). 


The' 
ACT8836A 
performs 
single-precision 
and double-precision 
multiplication 
in two's 
complement, 
unsigned magnitude, 
and mixed formats 
for both integer and fractional 
numbers. 


Input formats for the multiplicand 
(R) and multiplier 
(S) are given below, followed 
by output formats 
for 
the fully extended product. The fully extended product 
(PRDT) is 67 bits wide. It includes the extended 
product (XTP) bits YETP1-YETPO,the most significant product (MSP) bits Y63-Y32, and the least significant 
product 
(LSP) bits Y31-YO. 


Table 
2 
shows 
the 
output 
formats 
generated 
by two's 
complement, 
unsigned 
and 
mixed-mode 
multiplications. 


Two's Complement 
Unsigned Magnitude 


Two's Complement 
Two's Complement 
Two's Complement 


Unsigned Magnitude 
Two's Complement 
Unsigned Magnitude 


examples 


Representative examples of single-precision multiplication, double-precision multiplication, and division using 
Newton-Raphson 
binary division algorithm 
are given below. 


single-precision 
multiplication 


Microcode for the multiplication of two signed numbers is shown below. In this example, the result is rounded 
and the 32 most significant bits are output on the Y bus. A second instruction (SELY = 0) would be required 
to output 
the least significant 
half if rounding were not used. 


Unsigned and mixed mode single-precision 
multiplication 
are executed using the same code. (The sign 
controls must be modified accordingly.) 
Following are the input and output formats for signed, unsigned, 
and mixed mode operations. 


_231 
230 
229 
(Sign) 


_231 
230 
229 
ISign) 
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Unsigned 
Integer 
Inputs 
Input 
Operand 
A 
Input 
Operand 
B 


31 
30 
29 
2 
0 
31 
30 
29 
2 
0 


231 
230 
229 
.............. 
22 
21 
20 
231 
230 
229 
. .......... 
22 
21 
20 


Two's 
Complement 
Fractional 
Inputs 
Input 
Operand 
A 
Input 
Operand 
B 


I 
31 
30 
29 
2 
0 I 
I 
31 
30 
29 
. . . . . . . . . . . 
2 
0 I 


-20 
2-1 
2-2 
2-29 
2-30 
2-31 
-20 
2-1 
2-2 
2-29 
2-30 
2-31 
(Sign) 
(Sign) 


Unsigned 
Fractional 
Inputs 
Input 
Operand 
A 
Input 
Operand 
B 


31 
30 
29 
2 
0 I 
31 
30 
29 
2 
0 I 


2-1 
2-2 
2-3 ............... 
. 
2-30 
2-31 
2-32 
2-1 
2-2 
2-3 ................ 
2-30 
2-31 
2-32 


Two's 
Complement 
Integer 
Outputs 


Extended 
Product 
Most 
Significant 
Product 
Least Significant 
Product 
(YETP2-YETPO) 
(Y63-Y32) 
(Y31-YO) 


I 
66 
65 
64 I I 63 
62 
61 
34 
33 
32 I I 31 
30 
29 
......... 
2 
0 


_266 
265 
264 
263 
262 
261 
234 
233 
232 
231 
230 
229 
22 
2' 
2° 
'-.,-" 
(Sign1 


Unsigned 
Integer 
Outputs 


Extended 
Product 
Most 
Significant 
Product 
Least Significant 
Product 
(YETP2-YETPO) 
(Y63-Y32) 
(Y31-YO) 


66 
65 
64 I I 63 
62 
61 
34 
33 
32 I I 31 
30 
29 
2 
0 


266 
265 
264 
263 
262 
261 
234 
233 
232 
231 
230 
229 
22 
2' 
2° 


Two's 
Complement 
Fractional 
Outputs 


Extended 
Product 
(YETP2-YETPO) 
Most 
Significant 
Product 
(Y63-Y32) 
Least Significant 
Product 
(Y31-YO) 
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Extended 
Product 


{YETP2-YETPOI 


Most 
Significant 
Product 
(Y63-Y32) 
Least Significant 
Product 


(Y31-YOI 


66 
65 
64 1 
1_63 
__ 
6_2__ 6_1 
34__ 
33__ 
32~1 
1_31 
__ 
30__ 2_9 
2 
0_ 


double-precision 
multiplication 


To simplify 
discussion of double-precision 
multiplication, 
the following 
example implements 
an algorithm 


using one' ACT8836A 
device. It should be noted that even higher speeds can be achieved through 
the 


use of two 
'ACT8836As 
to implement 
a parallel multiplier. 


The example is based on the following 
algorithm 
where A and Bare 
64-bit 
signed numbers. 


Let 
Am = as,a62, a61,' .. 
, a32 
and 
AI = a31' a30, a29, ... 
, ao (ao 
LSB) 


Therefore: 


A = (Am 
x 
232) + AI 


Likewise: 


B = (Bm x 232) + BI 
Thus: 
A x 
B = [(Am 
x 
232) + All 
x 
[(Bm x 
232) + Bil 


= (Am 
x 
Bml 264 + [(Am 
x 
Bil + (AI x 
Bmll232 
+ AI x 
BI 


Therefore, 
four products 
and three summations 
with 
rank adjustments 
are required. 


Basic implementation 
of this algorithm uses a single' ACT8836A. 
The result is a two's 
complement 
128-bit 


product. 
Microcode 
signals to implement 
the algorithm 
are shown 
in Examples 1, 2, and 3. 


The first instruction 
cycle computes 
the first product, 
AI x 
BI. The least significant 
half of the result is 
output 
through 
the Y port for storage in an external 
RAM or some other 32-bit 
register; this will be the 


least significant 
32-bit 
portion 
of the final result. 


The instruction 
also uses the shifter to shift the AI x 
BI product 
32 bits to the right in order to adjust 


for ranking in the next multiplication-addition 
sequence. The least significant half of the shift result is stored 


in the lower 32-bit 
portion of the accumulator; 
the upper 32 bits contain the zero and fill. 


The second instruction 
produces the second product, AI x Bm, adds it to the contents of the accumulator, 


and stores the result in the accumulator 
for use in the third instruction. 


Instruction 
3 computes 
Am 
x 
BI, adds the result to the accumulator, 
and outputs 
the least significant 
32 bits of the addition 
for use as bits 63-32 
of the final product. 


This instruction 
also shifts the result 32 bits to the right to provide the necessary rank adjustment 
and 


stores the shift result (the most significant half of the addition resultl in the lower 32 bits of the accumulator. 
Bits ACC63-ACC32 
are filled with zeros; the sign is extended into the three upper bits (ACC66·ACC64). 


Instruction 
4 computes 
the fourth product 
(Am x Bm), adds it to the accumulator, 
and outputs 
the least 
significant 
half at the Y port for" use as bits 95-64 
of the final product. 


This example assumes that the chip is operating in feed-through mode. A fifth instruction is therefore required 
to perform the fourth 
iteration 
again so that bits 127-96 
of the final product 
can be output. 
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Operand 
Instruction Inputs 


Select 
Product 
Muhiplie,/ 
Register 
Registe, 
Feed· 
Clock 
Eneble. 
Y/PY 
R bus 
Rounding 
Comple· 
Adde, 
O·MUX 
Sign 
Shifl·MUX 
loed 
Write 
through 
Y·MUX 
Output 


S bu. 
Sign 
Control 
ment 
Mode 
S.lect 
Extend 
Control 
Select 
Enable 
Control 
I 
A 
B 
Y 
Select 
Enabla 


EA 
EB 
OASGN 
OBSGN 
RN01 
RNOO 
COMPl 
ACC1 
ACCO 
SElO 
SGNEXT 
SFT1 
SFTO 
SElREG 
iNEH 
WEl 
FT1 
FTO CKEI 
CKEA 
CKEB 
CKEY 
SElY 
OEY 


1 
1 
1 
1 
0 
1 
0 
0 
0 
0 
X 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
1 
0 


Opo,and 
Instruction 
Inputs 


Select 
Product 
Multiplier! 
Register 
Register 
Feed- 
Clock Enables 
Y/PY 


Instruction 
R bus 
Rounding 
Comple· 
Adde, 
O·MUX 
Sign 
Shifl·MUX 
load 
Write 
through 
Y·MUX 
Output 
Number 
S bu. 
Sign 
Control 
ment 
Mode 
Select 
Extend 
Control 
Select 
Enable 
Control 
I 
A 
B 
Y 
Select 
Enable 


EAEB 
OASGN 
OBSGN 
RN01 
RNOO 
COMPl 
ACC1 
ACCO 
SELD 
SGNEXT 
SFT1 
SFTO 
SElREG 
iNEH 
WEl 
FTl 
FTO CKEI 
CKEA 
CKEB 
CKEY 
SElY 
OEY 


111 
1 
1 
0 
0 
0 
0 
0 
0 
0 
0 
0 
1 
1 
0 
0 
0 
0 
0 
1 
1 
, 
1 
0 
0 


121 
1 
1 
0 
1 
0 
0 
0 
0 
1 
0 
0 
0 
0 
0 
0 
0 
0 
0 
1 
1 
1 
1 
X 
X 


131 
1 , 
1 
0 
0 
0 
0 
0 
, 
0 
0 
1 
1 
0 
0 
0 
0 
0 
1 
1 
1 
1 
0 
0 


141 
1 
1 
1 
1 
0 
0 
0 
0 
1 
X 
0 
0 
0 
X 
X 
X 
0 
0 
, 
1 
1 
1 
0 
0 


151 
, 
1 
1 
1 
0 
0 
0 
0 
1 
X 
0 
0 
0 
X 
X 
X 
0 
0 
1 
1 
1 
1 
1 
0 


Operand 
Instruction 
Inputs 


Select 
Product 
Multiplierl 
Register 
Register 
Faad- 
Clock Enables 
Y/PY 
Instruction 
R bus 
Rounding 
Comple· 
Adde, 
O·MUX 
Sign 
Shift·MUX 
load 
Write 
through 
Y-MUX 
Output 
Number 
S bu. 
Sign 
Control 
ment 
Mode 
Select 
Extend 
Control 
Select 
Enable 
Control 
I 
A 
B 
Y 
Select 
Enable 


EAEB 
OASGN 
OBSGN 
RN01 
RNOO 
COMPl 
ACC1 
ACCO 
SElO 
SGNEXT 
SFTl 
SFTO 
SElREG 
iNEH 
WEl 
FT1 
FTO CKEI 
CKEA 
CKEB 
CKEY 
SElY 
OEY 


Repeat N Times· 


111 
0 
1 
0 
0 
0 
0 
0 
0 
0 
0 
0 
1 
1 
1 
1 
0 
1 
0 
0 
0 
0 
0 
1 
0 


121 
0 
0 
0 
0 
0 
1 
0 
0 
0 
0 
0 
0 
1 
1 
0 
1 
1 
0 
0 
0 
0 
0 
1 
0 


End Repeat 


131 
0 
1 
0 
0_ 
0 
0 
0 
0 
0 
0 
0 
1 
1 
1 
, 
0 
1 
0 
0 
0 
0 
0 
1 
0 


f41 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
1 
1 
0 
1 
1 
0 
0 
0 
0 
0 
1 
0 


SN74ACT8836A 
32·BIT 
BY 32·BIT 
MULTIPLIER/ACCUMULATOR 


Newton-Raphson 
binary division algorithm 


The following 
explanation illustrates how to implement the Newton-Raphson 
binary division algorithm using 
the 'ACT8836A 
multiplier/accumulator. 
The Newton-Raphson 
algorithm 
is an iterative 
procedure 
that 


generates 
the reciprocal 
of the divisor through 
a convergence 
method. 


Consider the equation Q = A/B. This equation can be rewritten 
as Q = A x (1/B). Therefore, 
the quotient 
Q can be computed 
by simply multiplying 
the dividend A by the reciprocal of the divisor (B). Finding the 
divisor 
reciprocal 
l/B 
is the objective 
of the Newton-Raphson 
algorithm. 


To calculate 
1/B the Newton-Raphson 
equation, 
Xi + 1 = Xi(2-BXil 
is calculated 
in an iterative 
process. 
In the equation, 
B represents 
the divisor 
and X represents 
successively 
closer approximations 
to the 
reciprocal 1/B. The following sequence of computation 
illustrates the iterative nature of the Newton-Raphson 
algorithm. 


Step 1 
Step 2 
Step 3 


Step n 


XO(2-BXO) 
X1(2-BX1) 
X2(2-BX2) 


Xn-1 (2-BXn-1 ) 


The successive 
approximation 
of Xi, for all i, approaches the reciprocal 
l/B 
as the number of iterations 
increases; 
that is 


1im Xi = l/B 
i -+ n 


The iterative 
operation 
is executed 
until the desired tolerance 
or error is reached. The required accuracy 
for 1/B can be determined 
by subtracting 
each xi from its corresponding 
xi + 1. If the difference 
IXi + 1 


- 
Xi I is less than or equal to a predetermined 
round off error, then the process is terminated. 
The desired 
tolerance 
can also be achieved by executing 
a fixed number of iterations 
based on the accuracy 
of the 
initial guess of l/B 
stored in RAM of PROM. 


The initial guess, XO, is called the seed approximation. 
The seed must be supplied to the Newton-Raphson 


process externally 
and must fall within 
the range of 0 < XO < 2/B if B is greater than 0 or 2/B < XO < 0 


if B is less than O. 


To perform the Newton-Raphson 
binary division algorithm 
using the' ACT8836A, 
the divisor, 
B, must be 
a positive 
fraction. 
As a positive 
fraction, 
B is limited 
within 
the range of 1/2 
~ B < 
1. 


Since Xi from Newton-Raphson 
must lie between 0 < Xi < 2/B and since the range of the positive fraction 


B is 1/2 ~ B < 
1, then the limits of Xi become 
1 ~ Xi <2. 


The range of 
- BXi will therefore 
be - 2 ~ 
- BXi ~ 
- 1/2. 


The limits of - BXi are shown 
in Table 3 as they would appear in the' ACT8836A 
extended 
bit, binary 
fraction 
format. 


Extended Bits 
63 
62 
61 
...... 
2 
1 
0 
66 
65 
64 


1 
1 
1 
0 
0 
0 
...... 
0 
0 
0 


1 
1 
1 
1 
1 
0 
... .. . 
0 
0 
0 


The diagram indicates 
that 
- BXi is always 
of the form: 


111 
dO. 
dl 
d2 
dn-2 
dn-l 
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The next step in Newton-Raphson 
is to complete the 2 - BXi equation. The fractional representation 
of 2 is: 


001 
0.00 
00 


Extended Bits 
63 
62 
61 
...... 
1 
0 
66 
65 
64 


1 
1 
1 
dO 
dl 
d2 
...... 
dn-2 
dn-l 
+ 
0 
0 
1 
0 
0 
0 
..... 
0 
0 


= 
0 
0 
0 
dO 
dl 
d2 
...... 
dn-2 
dn-l 


Since this step only affects 
the extended bits (66-64) 
on the' ACT8836A, 
this step can be skipped. The 
following 
algorithm can therefore be used to perform Newton-Raphson binary division with the' ACT8836A. 


Assuming 
B is on the DB bus (or stored in the B register) and Xi is stored in the temporary 
register: 


Step 1 
Accumulator 
+- 
- (DB x temporary 
register) 
= 2-BXi 


Step 2 
Temporary 
Register 
+- Left shift one bit of 
(accumulator 
times temporary 
register) 
Xi+ 1 


= Xi (2-BXi) 


Step 3 
Repeat Steps 1 and 2 until IXi + 1 - 
Xi I $ a predetermined 
round-off 
error 


Two cycles are required for each iteration. 
The left shift that is performed in Step 2 is required to realign 
Xi after the signed fraction 
multiply. 
Microcode 
for this example is shown in Examples 1, 2, and 3. 
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Supply voltage, 
Vcc 
(see Note 1) 
-0.5 
V to 6 V 
Input clamp current, 
11K (VI <0 
or VI> VCC) 
± 20 mA 
Output 
clamp current, 
10K (VO < 0 or Vo > VCC) . 
± 50 mA 
Continuous 
output 
current, 
10 (VO = 0 to VCC) . . . . . . . 
± 50 mA 
Continuous 
current through 
VCC or GND pins. 
. . . . . . . . 
± 100 mA 
Operating free-air temperature 
range. 
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. 
ooC to 70°C 
Storage temperature 
range 
- 65 °C to 150 °C 


'Stresses beyond those listed under "absolute maximum ratings" may cause permanent damage to the device. These are stress ratings 
only and functional operation of the device at these or any other conditions beyond those indicated under "recommended operating 
conditions" 
is not implied. Exposure to absolute-maximum-rated conditions for extended periods may affect device reliability. 


NOTE 1: All voltage levels are with respect to GND. 


MIN 
NOM 
MAX 
UNIT 


Vee 
Supply voltage 
4.75 
5 
5.25 
V 


VIH 
High-level input voltage 
2 
Vee 
V 


VIL 
Low-level input voltage 
a 
0.8 
V 


10H 
High-level output current 
-8 
mA 


10L 
Low-level output current 
8 
mA 


VI 
Input voltage 
a 
Vee 
V 


Vo 
Output voltage 
a 
Vee 
V 


dtldv 
Input transition rise or fall rate 
a 
15 
nslV 


TA 
Operating free-air temperature 
a 
70 
°e 


electrical characteristics over recommended operating free-air temperature range (unless otherwise 
noted) 


PARAMETER 
TEST CONDITIONS 
vCC 
MIN 
TYP* 
MAX 
UNIT 


4.75 V 
4.6 
10H = 
-20 
p.A 
5.25 V 
5.1 


VOH 
V 
4.75 V 
3.85 
3.95 
10H = 
-8 
mA 
5.25 V 
4.60 
4.70 


4.75 V 
0.1 
10L = 20 p.A 
5.25 V 
0.1 


VOL 
V 
4.75 V 
0.32 
0.45 
10L = 8 mA 
5.25 V 
0.32 
0.45 


II§ 
VI = Vee or a 
5.25 V 
±0.1 
±5 
p.A 


Ieco 
VI = Vee or 0.10 
5.25 V 
lOa 
200 
p.A 


e, 
VI = Vee or a 
5V 
10 
pF 


Colee' 


One input at 3.4 V, 


5.25 V 
1 
mA 
other inputs at a or Vee 


*AII typical values are at Vee = 5 V. TA = 25°C. 
§For 1/0 ports. the parameter 10ZH and 10ZL include the offstate output current. 
'This is the increase in supply current for each input that is at one of the specified TTL voltage levels rather than a or Vee. 


TEXAS • 
INSTRUMENTS 


SN74ACT8836A 
32-BIT 
BY 32·BIT 
MULTIPLIER/ACCUMULATOR 


timing requirements over recommended 
ranges of supply voltage and operating free-air temperature 
(unless otherwise 
noted) 


setup and hold times 


PARAMETER 
MIN 
MAX 
UNIT 


tsu1 
Setup time, DA, DB, or I before CLKt' 
10 


tsu2 
Setup time, DA, DB, or I before CLKt, 00 MODE' 
45 


tsu3 
Setup time, CKEA before CLKt 
10 


tsu4 
Setup time, CKEB before CLKt 
10 


tsu5 
Setup time, CKEI before CLKt 
10 


tsu6 
Setup time, CKEY before CLKt 
10 


tsu7 
Setup time, SELREGbefore CLKt 
10 
ns 
tsuB 
Setup time, WEMS before CLKt 
10 


tsu9 
Setup time, WELS before CLKt 
10 


tsu10 
Setup time, SELD before CLKt 
10 


tsu11 
Setup time, EA or EB before CLKt, 11 MODE 
26 


tsu12 
Setup time, EA or EB before CLKt, 10 MODE 
45 


tsu13 
Setup time, EA or EB before CLKt, 01 MODE 
42 


tsu14 
Setup time, EA or EB before CLKt, 00 MODE 
45 


th1 
Hold time, DA, DB, or I after CLKt' 
a 


th2 
Hold time, DA, DB, or I after CLKt, 00 MODE' 
a 


th3 
Hold time, CKEA after CLKt 
a 


th4 
Hold time, CKEB after CLKt 
a 


th5 
Hold time, CKEI after CLKt 
a 


th6 
Hold time, CKEY after CLKt 
a 


th7 
Hold time, SELREGafter CLKt 
a 


Hold time, WEMS after CLKt 
ns 
th8 
a 


th9 
Hold time, WELS after CLKt 
a 


th10 
Hold time, SELD after CLKt 
a 


th11 
Hold time, EA or EB after CLKt 
a 


th12 
Hold time, EA or EB after CLKt 
a 


th13 
Hold time, EA or EB after CLKt 
a 


th14 
Hold time, EA or EB after CLKt 
a 


FT MODE 
MIN (ns) 
COMMENTS 


00 
Without Accumulator 
48 
Equal to tpd 10, see Note 2 


00 
With Accumulator 
45 
Equal to tsu2, see Notes 2 and 3 


01 
Without Accumulator 
45 
see Note 3 


01 
With Accumulator 
45 
see Note 3 


10 
Without Accumulator 
48 
see Note 3 


10 
With Accumulator 
48 
see Note 3 


11 
Without Accumulator 
30 
see Note 3 


11 
With Accumulator 
30 
see Note 3 


NOTES: 2. The FTA mode must include tsu2 because the set-up time is longer than the CLK to Y propagation delay. 
3. All cycle times shouid include tpd 13 if all 64 bits of result are to be obtained in a single cycle. 


PARAMETER 
MIN 
MAX 
UNIT 
I CLK high 
10 
tw 
Pulse duration 
I CLK low 
ns 
15 
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switching characteristics over recommended ranges of supply voltage and operating free-air temperature 
(unless otherwise 
noted) t 


FROM 
TO 
PARAMETER 
FT MODE 
MIN 
MAX 
UNIT 
(INPUT) 
(OUTPUT) 


tod1 
Y Reg 
01 or 11 
15 


tod2 
A, B, or I Reg 
Y 
10 
45 


tod3 
Accumulator 
00 
38 


tod4 
Y Reg 
01 or 11 
15 


tod5 
A, B, or I Reg 
YETP 
10 
45 


tod6 
Accumulator 
00 
38 


tnd7 
Y Reg 
01 or 11 
15 


tod8 
A, B, or I Reg 
ETPERR 
10 
45 


tod9 
Accumulator 
00 
38 


tod10 
Y 
00 
48 


tnd11 
DATA, I 
YETP 
00 
48 


ETPERR 
00 
ns 
tod12 
48 


tod13 
SELY 
Y 
All modes 
12 


tod14 
DA 
All modes 
16 
PERRA 
tod15 
PA 
All modes 
16 


tod16 
DB 
All modes 
16 
PERRB 
tod17 
PB 
All modes 
16 


tod18 
Y 
MSERR 
All modes 
16 


tod19 
PY 
PERRY 
All modes 
16 


tod20 
YETP 
MSERR 
All modes 
16 


tod21 
TPO-1 
All outputs 
All modes 
48 


ten 
OEY 
All modes 
15 
ns 
Y, YETP 
!dis 
All modes 
13 
ns 
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TIMING 
CLOADt 
IOL 
IOH 
VLOAD 
PARAMETERS 
(pF) 
(mA) 
(mA) 
(V) 


tpZH 
0 
ten 
tpZL 
50 
8 
-8 
3 


tdls 
tPHZ 
-8 
tpLZ 
50 
8 
1.5 


tod 
50 
8 
-8 
* 
t CLOAD includes probes and test fixture capacitance. 
* VLOAD - VOL = 50 n, where VOL = 0.6 V, IOL = 8 mA. 


IOL 


Timing 
Input 
(See Note A) 


14- tf 


VOLTAGE WAVEFORMS 
SETUP AND HOLD TIMES 
INPUT RISE AND FALL TIMES 


------ 
---- 
3V 
Input 
L 1.5V 
~ 
1.5V 
(See Note A) -.Ii 
. 
1\... 
0 V 


tpd 
14 
~I 
~ 
tpd 
I Y 
I 
1-- 
VOH 
In-Phase 
I 
1.5 V I 
\L ~.5 V 
Output 
. 
'\..::. 
--1--' 
I 
VOL 
I~ 
~, 
i+---+l- 
tpd 
tpd~ 
1 


Out-of-Phase 
\L 
!C 
VOH 
Output 
" 
1.5V 
T 1.5V 
_.----.- 
- 
- 
VOL 


~ 


--3V 


1.5V 
1.5V 
I 
I 
OV 
*--tw-.l 
I 
I 


~ 


I 
3V 


1.5V 
1.5V 
____ 
OV 


VOLTAGE WAVEFORMS 
PULSE DURATION 


Output 
Control 
(low-level 
enabling) 


1.5 V 


~---- 
OV 
tpZL --.J 
14- 
I 
I 
I 
tpLZ~ 
I+- 


__ 
I,...,•..•' 
U 
3V 


1 'J 
I 
1....---1.5V 


1 
\ 
1.5V 
1 L~L~O-=VVOL 


1 
. 
tPHZ-+! 
I+- 


tPZH -+j 
I +- 
1 ..i 
t 
\- 
VOH-= 0:3 ~OH 
Waveform 
2 
1 5 V 
~ 
(See Note C) 
~ 
. ; ~.5 V 


Waveform 
1 
(See Note B) 


Notes: 
A. Phase relationships between waveforms were chosen arbitrarily. All input pulses are supplied by pulse generators having the following 
characteristics: 
PRR = 1 MHz, Zo = 50 n, tr" 
6 ns, tf" 
6 ns. 


B. Waveform 
1 is for an output with internal conditions such that the output is low except when disabled by the output control. 


C. Waveform 2 is for an output with internal conditions such that the output is high except when disabled by the output control. For tpLZ 
and tpHZ, VOL and VOH are specified values. 
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CKEA. 
CKEB_~ 
_ 


CKE!. 
CKEY 
2Z(' 
I 
INSTR 
zzx:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::=x:::= 


I 
I 
I 
I 
DATAZZX:::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::=x:::= 


1 


1 
I 


1 


1 
SELY 
_-" 
-( 


1 
I 
I 
I 
I 
I 
OEYSSSSS\SSSSSSSSSSS\SSS\SSS\SSSS\SSSSSSSS 
~ 
I 
«ZZ7l 


I 
I 
I 
I 
I 
~ 
ten 
I 
tdis -;......: 
I. 
tpdl0 
"t 
~ 
tpd13 
I 
Y31·YO 
SSSSSSSSSSSSSSSSSS\SSSSSSSS\SSSSSSSSSSSSSSSX---L-SP-~X~--MS~P--XS 


CLK 
r----l .. 
_ 


1 


1 
CKEA. 
CKEB 
I 
CKEI. 
CKEY 
Zf 
I 
I 
I 
INSTR2!< 
I 
I 
I 
I 
I 
I 
I 
DATA2IC 
I 


I 
I 


I 
I 
I 
I 


I 
I 
~~::..~~~~ 
S\sssssssssssssssssssssssssssssssssss~: 
41ZZZZZZZZZZ 


: 
tSU7"tSU10~: 
I 
~th7"th9~ 


: 


I 
ACCUMULATOR_. 
X~---~SU=M~.O=F ...,=OO=U=CT----- 


I 
1 


I. 
tsu2 
., 


1 
1 
SELYI 
I 
~ 


1 
I 
: 


I 
I 
1- 
I 
I 
r- 
i 
I 
tpd13~ 
tdis~ 
J. 
tpdl0 
1 
.: r-tpd3: 
: 
Y31·YO 
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSI\SSX 
LSP 
X---M-S-P--xs: 
ten~ 
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ClK 
..••~~ 
~\_ 
_ 


I 
I 
I 
I 


CKEA. CKEB V\h 
I 
: 


CKE!. CKEY 
~tsU3.tsu6-.: 
I. 


INSTR~: 
~; 
_tsu1_ 
I 
: 
iol 
th1---~ 


DATA ::::K 
A '. 
8 1 
JI(~::::::~::::::::::::::::::::::::::::::= 


4?ZZZZZZZZZZZZZZZZZZZZ( 


01 th3·th6 


1 
SElY 
I 
A' 


I 
I 


1 
I 
I 
lID' 
" 
: 
I 
",.,..- 
: 
I+----*-tpd1: 
tdis...J+---.l 


: 
: 
tpd13-14---+j 
: 


Y31·YO ~s~,~S-S-S~S~S~S-S-S-S-S~S~S-S-S-S~S~S~S-S-S~S~S~S-S-S-S~S~S~S~S-S-S~S~S~S~S-S 
•...... 
x 
esp, 
X~--=M=SP~1--~X:S 
I 
, 
ten-!4--------+1 


: 
I 
1 


CKEA. CKEB ~ 
I 
: 
: 
~ 


CKE!. CKEY 
--: 
~tSU3.tSU6 
: 
: 
th3-th6~:.~--~ 


INSTR 
~ 
: 
>;C~:""'----~X: 
X: 
X~::======= 


---., 
r-- tsu1 
1 
I 
I 
1 
I 


1 
_th1-----., 
I 
1 
1 
I 


' 
1 
I 
I 
1 
I 


DATA 
~ 
A1.81 
X: 
A2.82 
X: 
A3.83 
X: 
A4 84 
X~.,...: 
-A=S~85~- 
~~::.~:~~ 
~~ 
.: 
~ 
: 
/ 
: 
: 


su7' su10 i..--..Lth7.th9 
i 
i 
: 


ACCUMULATOR 
~ff+!X 
SUM0' PRODUCT': 
X 
SUMO' PRODUCT2: 
X 
: 


I 
I 
I 
I 


YREG ~?/-~?-?~?~?/-~Z-%~Z~?'/~~~?-?~?~;//~%-?~?~?/-~z"'~~jX 
PRODUCT1 
: X 
PRODUCT2 
: X 
PRODUCT3 
: X~ 
_ 


I 
I 
I 
I 


SElY 
'--,( 
''l-' 
/~---;-,----- 


I 
I 
I 
I 
I 
I 


I 
I 
I 
1/ 
" 
I 
')J 
I 
'I 
'I 
I 


II 
I 
t 
I 
" 
I 
tpd1~ 
1 
Pd1...--.; 
_tpd13 
I 


ten----!...--..i ---.+-- tpd13 
1 
I 
--.I 14-- tdis 
I 


"" 
I 
I 
I 


Y31·YO 
-z...:-?-?-%-?/--?-?-?-?/--?-?-?-?/--Z-?-?-?/--Z-?-%-~--M-SP-' -~'?////ZWIZ/M7ff//Z/dZ/Z' 
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CLK 
~,---,~ 
_ 


1I. 
CKEA. CKEB~ 
: 


CKEI 
1 
1 


~tsu3-tsu5 
--- 
INSTR::jc 
: 
I 
I 


I'" 
tsu1 
-I 
I 
-IO------th1 


DATA~ 
: 


I 
: 
~ 


1 
I 
: 
" 
: 
r- 
: 
4ten 
:'---"tpd13 
: 


,. 
tpd2-----: 
: 
tdiS---: 


Y31-YO~/>I<---l-S-P---)I(~--M~S~P--~~ 


~ 
th3-th5 
~/A 


>,:---------- 
I 
I 


·1*~--------- 


, 
, 
, 
--.I ,...tsu3-tsu5 
I 
I 
th3-th5 __ 


CKEA. CKEB~ 
I 
::: 
k 


CKEI 
-: 
i 
1 
i- 


I 
I 
1 
I 


INSTR~ 
: 
11 
~ 
: 
12 
X : 
'3 
X : 
14 
X : 
IS 


_ 
I+-tsu1 
I 
I 
I 
I 
1 


1 
~th1-----" 
I 
I 
I 
I 


I I 
I I 
1 
I 
I 


DATA z:2i< : 
A'," 
X I 
A2,'2 
)I( : 
A3,'3 
X 
: 
A4,'4 
X~~: 
-A-S,-'-S-- 


SELREG.SELD07~/'2A: 
D~~~ 


WEMS. WELS 
I t 
t 
~ 
I 
I 
1 
I 
I 
su7-su9~ 
1 
I 
I 
I 


1 
~th7'th9_ 
1 
I 
I 


ACCUMULATOR07~0 
'UM-O'-PROOUCT' 
: 
: 
: 


I 
1 
I 
I 
I 
1 
l, 
I 
1 
I 


SEL Y 
I 
A'~-----<I1'------/ 
I 
1 
I 


I 
I 
II 
I 
1 
I 


OEY~ 
I 
I 
II 
), 
I 
I 
I:: 
II 
t 
I: 
: 
: 
ten--i+--i---l 
-'tpd13 
~ 
l+ pd13 
~tdis 
I 
I 


tpd2~ 
: 
jitpd2 
: 
: 
: 
I 


Y31-YO0V~ 
M.P, 1 
~ 
M.P2: m000'////~ft/Z//Z0V/ 
I 
I 
'I 
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~-lsu3-lsu6 
CKEA. CKEBm. 
I 


CKEI. CKEY 
: 


INSTR =:¥ : 
x~-,-__ ~x,-+ 
~x,-+ 
~x,-+ 
~x~-+- 
x~ 
_ 
Isu1+! 
14- 


I 
--lh1~ 
DATA =:)1<: 
A'. Bl 
*--A-2.-B-2--X 


I 


PIPE ~:::::Lm~:::::L2Z~:zj~C]IN~T~R !!PR~oiEoI' 
t=:C]IN1!T~R 
!!PR~O!§DI2 
t=CJIN~TER~PR~O~D~3t=>CJIN~TER~P~RO~D~2~>C~IN~TR~PR@O~D3rt==== 


I 
I 
I 
I 
I 
..L 
YREG 7/o/~~ffiX 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
OEY -------------)J 
.(~~I--------- 


I 
I 
len~ 
-.: 
~lpd13 
I 
I' 
I 
: 
I 
I 
I 
I d1~ 
I 
I 
dls~ 
I 


~~.L 
•.••.•.•~~~.1. 
••.•.•..~~P~~1 
I ~~ 
..••• 
.L~~~ 
Y31-Y0«?7~~/C 
LSP, 
MSPl 
LSP2 
MSP2 
LSP3 
MSP3 ~A0Z'Z'Z' 


I 
! 
I 
I 
I 
I 


I 
I 
I 
I 


CKEA. CKEB 
~ 
Isu3-lsu6 
: 
: 
Ih3-lh6 ~ 
CKEI. CKEY m. 
I 
I 
I 
: 
J0Z:Z 


I 
I 
I 
I 


INSTR =:¥ : 
~ : 
X 
~X~~_~X~~_~X_: 
__ 
~X 
: 
Isu1+! 
14- 
I 
I 
I 
I 


~lh1_ 
I 
I 
I 


DATA =::)<: 
A1.Bl 
X: 
A2.B2 
X.•..•.~A~3.~B~3-~X~::=======~X::::::=======~X~:---~X::::::======== 


I 
I 
I 
I 


PIPE %W~~ 
PP' 
X 
PP2 
X 
PP3 
X 
PP. 
: X 
PP5 
: ~ 


I 
I 
~lh7-lh9 
I 
I 
I 
I 
~~~ES~'~:~~ 
mz;f~rnw/;t-Lsu~ 
i 
i 
ACCUMULATORw;)~f'Yava:'Z/ffp 
Pl 
X 
P2 
i X 
±PW3 i X 
±P2±P'i 
:>@§±P5 


YREG w;)~~ffp 
P1 
X 
P2 
! X 
±Pl±P3 i X 
±P2±P.! 
:>@§±P5 


I 
I 
I 
I 
L 
...L 
SELY 
I 
I 
I 
.(-----).. 
I ~//0000V;1W2//0'Z 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
: 
I 
I 
I 
I 
I 
11 
I 
I 
:-f 
I 
I 
I 
I 
len~ 
-.I ,.tpd13 
I 
-41 I4-ldis 
I 
I 
i 
I 
I 
I 
I 
I 
-14--+4 
I 
I 
I 
I 


I 
I 
Ipd1~ 
I 
pd13 
I 
I 
I 
I 
I 


Y31-Y0«?7~Avzav;0vff_~~ 
I 
I 
I 
I 
I 
I 
I 
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• 
High-Speed Programmable 
Switch 
for 
Parallel Processing Applications 


• 
Dynamically 
Reconfigurable 
for Fault- 
Tolerant 
Routing 


• 
64 Bidirectional 
Data I/Os in 16 Nibble 
Groups of Four Bits Each 


• 
Data I/O Selection 
Programmable 
by Nibble 


• 
Eight Banks of Control Flip-Flops for Storing 
Configuration 
Programs 


• 
Two Selectable 
Hard-Wired Switching 
Configurations 


• 
Selectable 
Stored-Data 
or Real-Time Inputs 


• 
156-Pin Grid-Array 
Package 


• 
Low-Power 
1-pm EPIC'· CMOS Process 


• 
Single 5-V Power Supply 


description 


The SN74ACT8841 
A is a flexible, 
high-speed digital crossbar switch. 
It is easily microprogrammable 
to 
support user-definable interconnection 
patterns. This crossbar switch is especially suited to multiprocessor 


interconnects 
that are dynamically 
reconfigurable 
or even reprogrammable 
after each system clock. The 
'ACT8841 A is built in Texas Instruments 
advanced 
1-pm EPIC'" CMOS process to enhance performance 
and reduce power consumption. 
The switch 
requires only a 5-V power supply. 


SN74ACT8841A 


DIGITAL CROSSBAR SWITCH 


15 x 15 GC PACKAGE 


(TOP VIEW) 


Because the' ACT8841 A is a 16-port device, system architectures 
based on the' ACT8841 A can include 
up to 16 switching 
nodes, which may be processors, data memories, or bus interfaces. 
Larger processor 
arrays can be built with multistage interconnection 
schemes. Most applications will use the crossbar switch 
as a broadband 
bus interface 
controller, 
for example, 
between 
closely 
coupled 
processors 
that 
must 
exchange data with very low propagation 
delays. 


The' ACT8841 A has ten selectable control sources, including eight banks of programmable 
control flip- 
flops and two 
hard-wired 
control circuits. 
The device can switch 
from 
1 to 16 nibbles (4 to 64 bits) of 
data in a single cycle. 


The 64 I/O pins of the' ACT8841 A are arranged in 16 switchable 
nibbles (see Figure 1). A single input 
nibble can be broadcast to any combination 
of 15 output 
nibbles, or even to 16 nibbles (including itself) 
if operating off registered data. Multiple input nibbles can be switched 
to multiple outputs, 
depending on 
the programmed 
configurations 
available in the control 
flip-flops. 


The digital crossbar switch 
is intended primarily for multiprocessor 
interconnection 
and parallel processing 
applications. 
The device 
can be used to select 
and transfer 
data from 
multiple 
sources 
to multiple 
destinations. 
Since it can be dynamically 
repr.ogrammed, it is suitable for use in reconfigurable 
networks 
for fault-tolerant 
routing. 


The' ACT8841 A is characterized 
for operation 
from OOC to 70°C. 


PRODUCTION DATA documents 
contain 
information 
current 
as of publication 
date. Products 
conform 
to 
specifications 
per the terms 
of Texes 
Instruments 
standard warranty. Production processing does not 
necessarily 
include 
testing 
of all parameters. 


TEXAS 
~ 
INSTRUMENTS 


SN74ACT8841A 
DIGITAL 
CROSSBAR SWITCH 


NO. 
NAME 
NO. 
NAME 
NO. 
NAME 
NO. 
NAME 


A1 
GND 
C10 
D31 
H12 
VCC 
N7 
CNTR13 


A2 
GND 
C11 
OED6 
H13 
LSCLK 
N8 
CREADO 


A3 
D37 
C12 
VCC 
H14 
SELDLS 
N9 
VCC 


A4 
D35 
C13 
GND 
H15 
CNTR3 
Nl0 
DO 


A5 
D33 
C14 
D23 
J1 
OEC 
N11 
D3 
A6 
WE 
C15 
D21 
J2 
CRWRITEO 
N12 
D6 


A7 
CRADR1 
Dl 
D43 
J3 
CRWRITE1 
N13 
GND 


A8 
CNTR7 
D2 
D42 
J4 
GND 
N14 
D8 


A9 
CNTR4 
D3 
VCC 
J12 
GND 
N15 
D9 


A10 
OED7 
D7 
GND 
J13 
CNTR2 
P1 
GND 


A11 
D29 
D8 
VCC 
J14 
CNTR1 
P2 
GND 


A12 
D27 
D9 
GND 
J15 
CNTRO 
P3 
D56 


A13 
D25 
D13 
D22 
K1 
CRWRITE2 
P4 
D58 


A14 
GND 
D14 
D20 
K2 
OED12 
P5 
D60 


A15 
GND 
D15 
D19 
K3 
D48 
P6 
D62 
B1 
GND 
El 
D45 
K13 
D15 
P7 
CNTR12 


B2 
GND 
E2 
D44 
K14 
D14 
P8 
CNTR15 


B3 
D39 
E3 
OED10 
K15 
OED3 
P9 
TPO 
B4 
D36 
E13 
OED5 
L1 
D49 
P10 
OEDO 


85 
D34 
E14 
D18 
L2 
D50 
P11 
D2 


B6 
OED8 
E15 
D17 
L3 
OED13 
P12 
D4 
B7 
CRADRO 
F1 
OED11 
L13 
OED2 
P13 
D7 
B8 
CRSRCE 
F2 
D46 
L14 
D12 
P14 
GND 
B9 
CNTR5 
F3 
D47 
L15 
D13 
P15 
GND 
B10 
D30 
F13 
D16 
M1 
D51 
R1 
GND 
B11 
D28 
F14 
OED4 
M2 
D52 
R2 
GND 
B12 
D26 
F15 
CRSEL3 
M3 
D54 
R3 
D57 
B13 
D24 
G1 
CNTR8 
M7 
GND 
R4 
D59 
B14 
GND 
G2 
CNTR9 
M8 
VCC 
R5 
D61 
B15 
GND 
G3 
CNTR10 
M10 
GND 
R6 
OED15 


C1 
D41 
G4 
GND 
M13 
VCC 
R7 
CNTR14 
C2 
D40 
G12 
GND 
M14 
D10 
R8 
CREAD1 


C3 
GND 
G13 
CRSEL2 
M15 
D11 
R9 
CREAD2 


C4 
D38 
G14 
CRSEL1 
N1 
D53 
R10 
TP1 


C5 
OED9 
G15 
CRSELO 
N2 
D55 
R11 
D1 
C6 
D32 
H1 
CNTR11 
N3 
GND 
R12 
OED1 


C7 
VCC 
H2 
SELDMS 
N4 
VCC 
R13 
D5 
C8 
CRCLK 
H3 
MSCLK 
N5 
OED14 
R14 
GND 
C9 
CNTR6 
H4 
VCC 
N6 
D63 
R15 
GND 
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CRCLK 
CRSELO 


CRSEL1 
SELECT 
READ 
CRSEL2 
CONTROL 
CRSEL3 
CONTROL 
REGISTER 
REGISTER 
ADDRESS I 


--.-CRAORO 
\l 
LOAD 
-+-CRAOR1 


TEST I 
TPO 


TP1 


CLK 
ILSH 
MSHI 
CLK 
MSCLK 


SELECT 
SELOMS 


2 


oE03 
3 
015·012 


LSH 


oE04 
DATA 


019·016 
4 


oED5 


023·020 
5 


oE06 


6 


oED7 


031·028 


CRCLK 


oEC 


CNTR3·CNTRO 


CNTR7·CNTR4 


CNTR11·CNTR8 


CNTR 15·CNTR 12 


LSCLK 


SELOLS 


<I> 


DIGITAL 
CRoSS8AR 
SWITCH 


ACT8841 
A 
WRITE EN 


REG 8ANKI 


DESTIN A liON I 


DATA 
CNTR 


_WE 


___+_-CREAOO 


CREA01 
___ 
CREA02 


___ 
CRWRITEO 


___ 
CRWRITE1 


___ 
CRWRITE2 


CRSRCE 


DE08 
8 
035·032 


oE09 
9 
039·036 


oE010 
10 
043·040 


oE011 
11 
047·044 


MSH 
12 
051·048 


DE013 
13 
055·052 


oE014 
14 
059·056 


oED15 
15 
063·060 
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SElDMS 


MSClK 
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PIN 
DESCRIPTION 
110 
NAME 
NO. 


CNTRO 
J15 


CNTR1 
J14 


CNTR2 
J13 


CNTR3 
H15 


CNTR4 
A9 


CNTR5 
B9 


CNTR6 
C9 


CNTR7 
A8 
Control I/O. Inputs four control words to the control flip-flops 
on each CRCLK cycle. As outputs, 
the 
I/O 
CNTR8 
G1 
same addresses can be used to read the flip-flop 
settings. 


CNTR9 
G2 


CNTR10 
G3 


CNTR11 
H1 


CNTR12 
P7 


CNTR13 
N7 


CNTR14 
R7 


CNTR15 
P8 


CRADRO 
B7 
Control register address. Selects 16-bits of control flip-flops as a source/destination 
for outputs/inputs 
I 
CRADR1 
A7 
on CNTRO-CNTR15. (see Table 7) 


CRCLK 
C8 
I 
Control register clock. Clocks CNTRO-CNTR15 into the control flip-flops 
on low-to-high 
transition. 


CREADO 
N8 


CREAD1 
R8 
I 
Selects one of eight banks of control flip-flops to read out on CNTRO-CNTR15 in 16-bit words 


CREAD2 
R9 


·addressed by CRADR1-CRADRO. 


CRSELO 
G15 


CRSEL1 
G14 
I 
Selects one of ten control configurations. 


CRSEL2 
G13 


CRSEL3 
F15 


CRSRCE 
B8 
I 
Load source select. When low selects CNTR inputs, when high selects DATA inputs. 


CRWRITEO 
J2 


CRWRITE1 
J3 
I 
Destination select. Selects one of eight control banks. (see Table 41 


CRWRITE2 
K1 


DO 
N10 


01 
R11 


02 
P11 


03 
N11 


04 
P12 


05 
R13 


06 
N12 


07 
P13 


08 
N14 


09 
N15 
I/O 
I/O data bits 0 through 
17 (data bits 0 through 31 are the least significant 
half). 


010 
M14 


011 
M15 


012 
L14 


013 
L15 


014 
K14 


015 
K13 


016 
F13 


017 
E15 
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PIN 
DESCRIPTION 
1/0 
NAME 
NO. 


D18 
E14 
D19 
D15 
D20 
D14 
D21 
C15 
D22 
D13 
D23 
C14 
D24 
B13 
110 
110 data bits 18 through 31 (data bits 0 through 31 are the least significant half). 
D25 
A13 
D26 
B12 
D27 
A12 
D28 
Bll 
D29 
All 
D30 
Bl0 
D31 
Cl0 
D32 
C6 
D33 
A5 
D34 
B5 
D35 
A4 
D36 
B4 
D37 
A3 
D38 
C4 
D39 
B3 
D40 
C2 
D41 
Cl 


D42 
D2 
D43 
Dl 
D44 
E2 
D45 
El 


D46 
F2 
D47 
F3 
D48 
K3 
D49 
110 
I/Odata bits 32 through 63 (data bits 32 through 63 are the most significant half). 
L1 
D50 
L2 
D51 
Ml 
D52 
M2 
D53 
Nl 
D54 
M3 
D55 
N2 
D56 
P3 
D57 
R3 
D58 
P4 
D59 
R4 
D60 
P5 
D61 
R5 
D62 
P6 
D63 
N6 
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PIN 
I/O 
DESCRIPTION 


NAME 
NO. 


GND 
Al 


GND 
A2 


GND 
A14 


GND 
A15 


GND 
Bl 


GND 
B2 


GND 
B14 


GND 
B15 


GND 
C3 


GND 
C13 


GND 
D7 


GND 
DS 


GND 
G4 


GND 
G12 


GND 
J4 
Ground (all pins must be used). 


GND 
J12 


GND 
M7 


GND 
Ml0 


GND 
N3 


GND 
N13 


GND 
Pl 


GND 
P2 


GND 
P14 


GND 
P15 


GND 
Rl 


GND 
R2 


GND 
R14 


GND 
R15 


LSCLCK 
H13 
I 
Clocks the least significant 
half of data inputs into the input registers on a low-to-high 
transition. 


MSCLK 
H3 
I 
Clocks the most significant 
half of data inputs into the input registers on a low-to-high 
transition. 


OEC 
Jl 
I 
Output enable for control flip-flops, 
active low 


OEDO 
Pl0 


OEDl 
R12 


OED2 
L13 


OED3 
K15 


OED4 
F14 


0ED5 
E13 


OED6 
Cll 


OED7 
Al0 


OED8 
I 
Output enables for data nibbles, active low 


B6 


OEDS 
C5 


OED10 
E3 


OEDll 
Fl 


0ED12 
K2 


OED13 
L3 


OED14 
N5 


OED15 
R6 
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PIN 
DESCRIPTION 
1/0 
NAME 
NO. 


SELDLS 
H14 
I 
When low, selects the stored, least significant 
data input to the main internal bus. When high, real- 


time data is selected. 


SELDMS 
H2 
I 
When low, selects the stored, most significant 
data input to the main internal bus. When high, real- 


time data is selected. 


TPQ 
P9 
Test pins. High during normal operation. (See Table 6). 
I 
TP1 
R1Q 


VCC 
C7 


VCC 
C12 


VCC 
D3 


VCC 
D8 


VCC 
H4 


VCC 
H12 
5-V supply 


VCC 
M8 


VCC 
M13 


VCC 
N4 


VCC 
N9 


WE 
A6 
I 
Write enable for control flip-flops, 
active low 
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overview 


The 64 I/O pins of the' ACT8841 A are arranged in 16 nibble groups of four bits each, where each set 
of four pins serves as bidirectional 
inputs to and outputs 
from a nibble multiplexer. 
Ouring a switching 
operation, 
each nibble passes four bits of either stored or real-time data to the main internal 64-bit 
data 
bus. Each output 
multiplexer 
will independently 
select one of the 16 nibbles from this 64-bit 
data bus. 


Oata nibbles are organized into two groups: the least significant 
half (031-00) 
and the most significant 
half (063-032). 
Stored versus real-time data inputs can be selected separately for the LSH and the MSH. 
Two clock inputs, LSCLK and MSCLK, are available to latch LSH and MSH data inputs, respectively, 
into 
the data register. 


The pattern of output nibbles resulting from the switching 
operation is determined by a selectable control 
source, either one of eight banks of programmable 
control flip-flops 
or one of two 
hard-wired 
switching 
configurations. 
Inputs to the control flip-flops 
can be loaded either from the data bus or from controll/Os. 


A separate clock (CRCLK) is provided for loading the banks of control 
flip-flops. 


architecture 


The' ACT8841 A digital crossbar switch 
has its 64 data I/Os arranged in 16 multiplexer 
logic blocks, as 
shown in the functional 
block diagram. Each nibble multiplexer 
logic block handles four bits of real-time 
input and four bits of stored-data 
input, and either input can be passed to the common 
data bus. 


Two input multiplexer controls are provided to select between stored and real-time input's. SELOLS controls 
input data selection for the LSH (031-00) 
of the 64-bit data input, and SELOMS for the MSH (063-032). 


The input register clocks, 
LSCLK and MSCLK, are grouped in the same way and are used to clock data 
into the registers in the multiplexer 
logic blocks. The 16 data input nibbles make up the 64 data bits on 
the internal main bus. 


This common bus supplies 16 data nibbles to a 16-to-1 output multiplexer 
in each multiplexer 
logic block 
(see Figure 3). As determined 
by one of ten selectable 
control 
sources, the 16-to-1 
output 
multiplexer 
selects a data nibble to send to the outputs 
via the three-state 
output 
driver. 


Control of the input and output multiplexers 
determines the input-to-output 
pattern for the entire crossbar 
switch. 
Many 
different 
switching 
combinations 
can be set up by programming 
the 
control 
flip-flop 
configurations 
to determine the outputs 
from the 16-to-1 
multiplexers. 


For example, the switch can be programmed to broadcast one data input nibble through the other 15 nibbles 
(60 outputs). 
Conversely, 
a 15-to-1 nibble multiplexer 
can be configured 
by programming 
the switch 
to 
select and output a single data nibble from the 64-bit bus. Several examples are described in more detail 
in a later section. 
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multiplexer 
logic group 


There are 16 multiplexer 
logic blocks, 
one for each nibble. External data flows 
from four data I/O pins 
into a logic block. A block diagram of the multiplexer 
logic is shown in Figure 1. The data inputs are either 
clocked 
into the data register or passed directly 
to the main internal 
bus. The 64 bits of data from the 
main bus are presented to a 16-to-1 
multiplexer, 
which 
selects the data nibble output. 


Each of the 16 nibble multiplexer 
logic blocks contains 
eight control 
flip-flop 
(CF) groups, 
one for each 
of the control banks. A control bank stores one complete switching 
configuration. 
Each CF group consists 
of four Ootype edge-triggered 
flip-flops. 
In Figure 1, the CF groups are shown as CFXXO to CFXX7, where 
XX indicates the number of the nibble multiplexer 
logic group (0 ~ XX ~ 
15). CFXXO represents the 16 
CF groups (one from each logic block) that make up flip-flop 
control 
bank 0, CFXX 1 the 16 CF groups 
in bank 1, etc. 


In addition 
to the eight banks of programmable 
flip-flops, 
two 
hard-wired 
switching 
configurations 
can 
be selected. 
The MSH/LSH exchange directs the input nibbles from each half of the switch 
to the data 
outputs 
directly 
opposite. 
This switching 
pattern 
is shown in Table 1 below. 
For example, 
data input on 
011-08 
is output 
on 043-040, 
and data input on 043-040 
is output 
on 011-08. 


lSH 
MSH 
03-00 - 
035-032 
07-04 - 
039-036 


011-08 - 
043-040 
015-012 - 
047-044 


019-016 - 
051-048 


023-020 - 
055-052 
027-024 - 
059-056 


031-028 - 
063-060 


The second hard-wired 
configuration, 
a read-back function, 
causes all 64 bits to be output 
on the same 
I/Os on which they were input. Neither of the hard-wired 
control 
configurations 
affects 
the contents 
of 
the control 
banks. 


The control 
source select, CRSEL3-CRSELO, determines 
which 
switching 
pattern 
is selected, 
as shown 
in Table 2. 


CRSEl3 
CRSEl2 
CRSEl1 
CRSElO 
CONTROLSOURCE SELECTED 


l 
l 
l 
l 
Control bank 0 (programmable) 


l 
l 
l 
H 
Control bank 1 
(programmable) 


L 
l 
H 
L 
Control bank 2 
(programmable) 


L 
L 
H 
H 
Control bank 3 
(programmable) 


L 
H 
L 
L 
Control bank 4 
(programmable) 


L 
H 
L 
H 
Control bank 5 
(programmable) 


L 
H 
H 
L 
Control bank 6 
(programmable) 


L 
H 
H 
H 
Control bank 7 
(programmable) 


H 
X 
X 
L 
MSH/LSH exchange" 


H 
X 
X 
H 
Read-back (output echoes input)" 
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control words 


A CF group can store a four-bit 
control word (CFN3-CFNOI to select the output 
of the 16-to-1 
multiplexer 
for that nibble port. One control word is loaded in each CF group. A total of 16 words. 
one per multiplexer 
logic block. are loaded in a bank to configure 
one complete 
switching 
pattern. 
Table 3 lists the control 
words 
and the input data each selects. 


Each control 
word can be stored in a CF group and sent as an internal control 
signal to select the output 
of a 16-to-1 multiplexer 
in a nibble logic block. For example. any CF group loaded with the word "LHHH" 
will select the data input on 031-028 
as the outputs 
of the associated 
nibble. If all 16 CF groups in a 
bank were loaded with 
"LHHH." 
the same output 
(031-028) 
would 
be selected 
by the entire switch. 


INTERNAL SIGNALS 
INPUT DATA SELECTED AS 


CFN3 
CFN2 
CFN1 
CFNO 
MULTIPLEXER OUTPUT 


L 
L 
L 
L 
03-00 


L 
L 
L 
H 
07-04 


L 
L 
H 
L 
011-08 


L 
L 
H 
H 
015-012 


L 
H 
L 
L 
019-016 


L 
H 
L 
H 
023-020 


L 
H 
H 
L 
027-024 


L 
H 
H 
H 
031-028 


H 
L 
L 
L 
035-032 


H 
L 
L 
H 
039-036 


H 
L 
H 
L 
043-040 


H 
L 
H 
H 
047-044 


H 
H 
L 
L 
051-048 


H 
H 
L 
H 
055-052 


H 
H 
H 
L 
059-056 


H 
H 
H 
H 
063-060 


loading control configurations 


CRWRITE2-CRWRITEO select which 
control 
bank is being loaded. as shown 
in Table 4. 


CRWRITE2 
CRWRITE1 
CRWRITEO 
DESTINATION 


L 
L 
L 
Control bank 0 


L 
L 
H 
Control bank 1 


L 
H 
L 
Control bank 2 


L 
H 
H 
Control bank 3 


H 
L 
L 
Control bank 4 


H 
L 
H 
Control bank 5 


H 
H 
L 
Control bank 6 


H 
H 
H 
Control bank 7 
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The control words for a bank can be loaded either 16 bits at a time on the control 110 pins (CNTR 15-CNTRO) 
or all 64 bits at once on the data inputs (063-00). 
If the control load source select, CRSRCE, is high, the 
words 
are loaded from the data inputs. 
When CRSRCE = L, the CNTR inputs are used. 


When a control bank is loaded from the data inputs, WE, CRSRCE, CRWRITE2-CRWRITEO, and the control 
register clock CRCLK are used in combination 
to load all 16 control 
words 
(64 bits) in a single cycle. A 
MSH/LSH exchange like that shown in Table 3 is used to load the flip flops on a rising CRCLK clock edge. 
For example, data inputs 03-00 
go to the data bus and then to the CF group that selects the data outputs 
for 035-032. 
CRWRITE2-CRWRITEO select the control 
bank that 
is loaded (see Table 6). 


The 
CNTR15-CNTRO 
inputs 
can also be used to 
load the 
control 
banks. 
The 
bank 
is selected 
by 
CRWRITE2-CRWRITEO (see Table 6). Four control words per CRCLK cycle can be input to the CF groups 
(CFXX) that 
make up the bank. The CF groups loaded are selected 
by CRAOR1-CRAORO, as shown 
in 
Table 5. Four CRCLK cycles are needed to load an entire control 
bank. 


CF GROUPS 
lOADED 
BY 


CRAD1 
CRADO 
WE 
CRClK 
CONTROL 
(CNTRI I/O NUMBERS 


15-12 
11-8 
7-4 
3-0 


l 
l 
l 
S 
CF12 
CF8 
CF4 
CFO 


l 
H 
l 
S 
CF13 
CF9 
CF5 
CFl 


H 
l 
l 
S 
CF14 
CF10 
CF6 
CF2 


H 
H 
l 
S 
CF15 
CFll 
CF7 
CF3 


X 
X 
H 
X 
Inhibit write to flip-flops 


To read out the control 
settings, 
the same address signals can be used, except that no CRCLK signal is 
needed and OEC is pulled low. CREA02-CREAOO select the bank to be read; the format 
is the same as 
for CRWRITE2-CRWRITEO, shown 
in Table 4. 


Using the control 
I/0s 
to read the control bank settings 
can be valuable during debugging or diagnostics. 


Control settings are volatile and will be lost if the' ACT8841 A is powered off. An external program controlling 
switch 
operation 
may need to read the control 
bank settings 
so that it can save and restore the current 
switching 
configurations. 


test pins 


TP1-TPO test pins are provided for system testing. 
As Table 6 shows, 
these pins should be maintained 
high during normal operation. 
To force all outputs 
and I/0s 
low, low signals are placed on TP1-TPO and 


111 output enables (OE015-0EOO 
and OEC). To force all outputs 
and I/0s 
high, TP1 and all output 
enables 


are pulled low, and TPO is driven high. When TPO is left low and a high signal is placed on TP1, all outputs 
on the' ACT8841 A are placed in a high-impedance 
state, isolating the chip from the rest of the system. 


OED15- 
OEe 
TP1 
TPO 
OEDO 
RESULT 


l 
l 
l 
l 
All outputs and I/Os forced low 


l 
H 
l 
l 
All outputs and liDs forced high 


H 
l 
X 
X 
All outputs placed in a high-impedance state 


H 
H 
X 
X 
Normal operation Idefault state) 
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examples 


Most' ACT8841 A switch configurations 
are straightforward 
to program, involving few control signals and 
procedures to set up the control words in the banks of flip-flops. Control signals and procedures for loading 
and using control 
words are shown in the following 
examples. 


broadcasting 
a nibble 


Any of the 16 data input nibbles can be broadcast to the other 15 data nibbles for output. 
For ease of 
presentation, 
input nibble 063-060 
is used in this example. Example 1 presents the microcode sequence 
for loading flip-flop 
bank 0 and executing 
the nibble broadcast. 


The 
low 
signal on CRSRCE selects 
CNTR15-CNTRO 
as the 
input 
source, 
and the 
low 
signals 
on 
CRWRITE2-CRWRITEO select flip-flop 
bank 0 as the destination. 
Table 3 shows that to select data on 
063-060 
as the output nibble, the four bits in the control word CFN3-CFNO must be high; therefore the 
CNTR15-CNTRO inputs are coded high. The four microcode instructions shown in Example 1 load the same 
control 
word from CNTR15-CNTRO into all 16 CF groups of bank O. 


Once the control 
flip-flops 
have been loaded, the switch 
can be used to broadcast 
nibble 063-060 
as 
programmed. 
The microcode 
instruction 
to execute the broadcast 
is shown 
as the last instruction 
in 
Example 1. WE is held high and the data to be broadcast is input on 063-060. 
The high signal on SELOMS 
selects a real-time data input for the broadcast. 
MSCLK and LSCLK (not shown) can be used to load the 
input registers if the input nibble is to be retained. No register clock signals are needed if the input data 
is not being stored. 


The banks of control flip-flops 
not selected as a control 
source can be loaded with 
new control 
words 
or read out on CNTR15-CNTRO while the switch 
is operating. 
For example, the MSH data inputs can be 
used to load flip-flop 
bank 1 of the LSH while bank 0 of the LSH is controlling 
data 110. 
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" 
0 
'" 
-; 
~- 
i;j z 
'" ~ 
0 
X 
~ ;or;;i 
8 


~~ 
0~r; 


~~ 


!" 
;;1 
x 
l> 
'"~ 
'".., 
g: 


INST. 
CNTR 
I/O NUMBERS 
WE 
0ED16-0ED0 
- 
CRSRCE 
CRWR1TE2 
CRWRITEI 
CRWRlTEO 
CRADRI 
CRADRO 
CRSEl3 
CRSEl2 
CRSE1.1 
CRSELO 
sa.DMS 
SELDUI 
OEC 
CRCLK 
NO. 
1$·12 
11-8 
7..- 
3-0 


1 
0 
0 
0 
0 
0 
0 
1111 
1111 
1111 
1111 
X 
X 
X 
X 
0 
X 
X 
XXXX 
XXXX 
XXXX 
XXXX 
1 
S 


2 
0 
0 
0 
0 
0 
1 
1111 
1111 
1111 
1111 
X 
X 
X 
X 
0 
X 
X 
XXXX 
XXXX 
XXXX 
XXXX 
1 
S 


3 
0 
0 
0 
0 
1 
0 
1111 
1111 
1111 
1111 
X 
X 
X 
X 
0 
X 
X 
XXXX 
XXXX 
XXXX 
XXXX 
1 
S 


4 
0 
0 
0 
0 
1 
1 
1111 
1111 
1111 
1111 
X 
X 
X 
X 
0 
X 
X 
XXXX 
XXXX 
XXXX 
XXXX 
1 
S 


5 
X 
X 
X 
X 
X 
X 
XXXX 
XXXX 
XXXX 
XXXX 
0 
0 
0 
0 
1 
1 
X 
1000סס ooסס ooסס oo 
1 
None 


INST. 
NO. 
COMMENT 


1 
loads 
CF12. 
CF8. 
CF4. 
CFO of bank 
0 


2 
loads CF13. CF9. CF5. CFl of bank 0 


3 
Loads 
CF14, 
CF10. 
CF6. 
CF2 
of bank 
0 


4 
Loeds 
CF15. 
CF11. 
CF7. 
CF3 
of bank 
0 


5 
Selects 
bank 
0 for 
switching 
control 


Selects real-time data inputs 


INST. 
CNTR 
110 NUMBERS 
WE 
0ED15-0EDO 
OEC 
CRSRCE 
CRWRITE2 
CRWRITEl 
CRWRITEO 
CRADRl 
CRADRO 
CRSEL3 
CRSEL2 
CQSEll 
CRSElO 
SELDMS 
SELDLS 
CRCLK 
NO. 
15·12 
11-8 
7-4 
3-0 
, 
0 
1 
1 
1 
0 
0 
0100סס oo 
1100 
1000 
X 
X 
X 
X 
0 
X 
X 
xxxx 
xxxx 
xxxx 
XXXX 
1 
.f 


2 
0 
, 
1 
1 
0 
1 
0101 
0001 
1'01 
100' 
X 
X 
X 
X 
0 
X 
X 
XXXX xxxx 
XXXX 
XXXX 
, 
f 


3 
0 
1 
, 
1 
1 
0 
0111 
0011 
1111 
1011 
X 
X 
X 
X 
0 
X 
X 
xxxx 
xxxx 
xxxx 
XXXX 
1 
S 


4 
0 
1 
1 
1 
, 
, 
0"1 
0011 
1111 
1011 
X 
X 
X 
X 
0 
X 
X 
XXXX 
XXXX 
XXXx 
XXX 
X 
, 
f 


5 
X 
X 
X 
X 
X 
X 
xxx X xxxx 
xxxx 
xxxx 
0 
, 
1 
, 
, 
0 
0 
0000סס ooסס oo 
0000 
, 
None 


INST. 
NO. 
COMMENT 
, 
loads 
CFl 
2. 
CF8, 
CF4, 
CFO of bank 
7 


2 
loads 
CF13. 
CF9, 
CF5, 
CFl 
01 bank 
7 


3 
loads 
CF14, 
CF10. 
CF6. 
CF2 
of bank 
7 


4 
Loads 
CF15. 
CF11. 
CF7, 
CF3 of bank 
7 


5 
Selects 
bank 7 for switching 
control 


Selects 
registered 
data inputs 
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programming an MSH/LSH exchange 


A second, more complicated 
example involves programming 
the switch 
to swap corresponding 
nibbles 
between the MSH and the LSH (first nibble in the LSH for first nibble in the MSH, and so on). This swap 
can be implemented 
using the hard-wired logic circuit selected when CRSEL3 is high and CRSELO is low. 


Programming this swap without 
using the MSH/LSH exchange logic requires loading a different 
control 
word into each mux logic block. This is described below for purposes of illustration. 


Each nibble in one half, either LSH or MSH, selects as output the registered data from the corresponding 
nibble in the other half. The registered data from 035-032 
is to be output on 03-00, 
the registered data 
from 03-00 
is output on 035-032, 
and so on for the remaining nibbles. As shown in Table 2, the flip-flops 
for 03-00 
have to be set to 1000 and the 035-032 
inputs must be low. The CF groups and control words 
involved in this switching 
pattern are listed in Table 7. 


CNTRINPUTS 
CONTROL 
CF 
TO LOAD 
WORD 
RESULTS 
GROUP 
FLIP-FLOPS 
LOADED 


CF15 
0111 
031-028 
- 
063-060 


CF14 
CNTR15- 
0110 
027-024 
- 
059-056 


CF13 
CNTR12 
0101 
023-020 
- 
055-052 


CF12 
0100 
019-016 
- 
051-048 


CF11 
0011 
015-012 
- 
047-044 


CF10 
CNTR11- 
0010 
011-08 
- 
043-040 


CF9 
CNTR8 
0001 
07-04 
- 
039-036 


CF8 
0000 
03-00 
- 
035-032 
CF7 
1111 
063-060 
- 
031-028 


CF6 
CNTR7- 
1110 
059-056 
- 
027-024 


CF5 
CNTR4 
1101 
055-052 
- 
023-020 


CF4 
1100 
051-048 
- 
019-016 


CF3 
1011 
047-044 
- 
015-012 


CF2 
CNTR3- 
1010 
043-040 
- 
011-08 


CF1 
CNTRO 
1001 
039-036 
- 
07-04 


CFO 
1000 
035-032 
- 
03-00 


With this list of control 
words and the signals in Table 5, the 16-bit control 
inputs on CNTR15-CNTRO 
can be arranged to load the control flip-flops 
in four cycles. Example 2 shows the microcode instructions 
for loading the control 
words and executing 
the exchange. 


In Example 2, bank 7 of flip-flops is being programmed. Bank 7 is selected by taking CRWRITE2-CRWRITEO 
high and leaving CRSRCE low (see Table 4) when the control words are loaded on CNTR15-CNTRO. With 
WE held low, the CRCLK is used to load the four sets of control words. 
Once the flip-flops 
are loaded, 
data can be input on 063-00 
and the programmed 
pattern 
of output 
selection 
can be executed. 
A 
microinstruction 
to select registered data inputs and bank 7 as the control 
source is shown as the last 
instruction 
in Example 2. The data must be clocked into the input registers, using LSCLK and MSCLK, 
before the last instruction 
is executed. 
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The control 
flip-flops 
could also have been loaded from the data input nibbles in one CRCLK cycle. 
Input 
nibbles from one half are mapped onto the control 
flip-flops 
of the other half. All control 
words to set up 
a switching 
pattern 
should 
be loaded 
before 
the bank of flip-flops 
is selected 
as control 
source. 
The 
microcode 
instructions 
to load bank 1 with the 16 control 
words in one cycle are presented 
in Example 3. 


Example 3. Loading the MSH/LSH 
Exchange from 
Data Inputs 


CRWRITE2 
CRWRITE1 
CRWRITEO 
WE 
SELDMS 
SELDLS 
0ED15-0ED0 
o 
0 
1 
0 
1 
1 
1111111111111111 


These control nibbles may be loaded from the input as a 64-bit real-time input word or as two 32-bit words 
stored previously. 
To use stored control 
words, 
MSCLK and LSCLK are used to load the LSH and MSH 
input registers 
with the correct 
sequence of control 
nibbles. Whenever 
the flip-flops 
are loaded from the 
data inputs, all 64 bits of control 
data must be present when the CRCLK is used so that all control 
nibbles 
in a program 
are loaded simultaneously. 
Example 4 presents the three microcode 
instructions 
to load the 
MSH and LSH input 
registers 
and then to pass the registered 
data to flip-flop 
bank 2. 


INST. 
WE 
0ED15- 
CRSRCE CRWRITE2 CRWRITE1 CRWRITEO 
SELDMS SELDLS oroo 
CRCLK MSCLK 
LSCLK COMMENTS 
NO. 


1 
X 
X 
X 
X 
1 
X 
X 
1 
None 
S 
None 
Load inputs 


063-032 


2 
X 
X 
S 
Load inputs 
X 
X 
1 
X 
X 
1 
None 
None 
031-00 


Load control 
3 
1 
0 
1 
0 
0 
0 
0 
1 
S 
None 
None 
bank 2 


The control 
words 
in a program 
can also be read back from the flip-flops 
using the CNTR outputs. 
Four 
instructions 
are necessary 
to read the 64 bits in a bank of flip-flops 
out on CNTR15-CNTRO. 
WE is held 
high and OEC is taken low. 
No CRCLK signal is required. 
CREAD2-CREADO 
select 
bank 2 of flip-flops, 
and CRADR1-CRADRO 
select in sequence 
the four addresses of the 16-bit 
words 
to be read out on the 
CNTR outputs. 
Example 
5 shows 
the four microcode 
instructions. 


INST. 
~ 
- 
CNTR 110 NUMBERS 
CREAD2 CREAD1 CREADO 
CRADR1 CRADRO WE 
COMMENT 
NO. 
15-12 
11-8 
7-4 
3-0 


1 
0 
1 
0 
0 
0 
0 
1 
0100 
0000 
1100 
1000 
Read CF12, CF8, CF4, CFO 


2 
0 
1 
0 
0 
0 
1 
1 
0101 
0001 
1101 
1001 
Read CF13, CF9, CF5, CFl 


3 
0 
1 
0 
0 
1 
0 
1 
0110 
0010 
1110 
1010 
Read CF14. CF10, CF6, CF2 
4 
0 
1 
0 
0 
1 
1 
1 
0111 
0011 
1111 
1011 
Read CF15, CFll, 
CF7, CF3 
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absolute maximum ratings over operating free-air temperature range (unless otherwise noted) t 


Supply voltage, 
VCC (see Note 1) 
- 0.5 V to 6 V 
Input clamp current, 
11K (VI < 0 or VI > VCC) 
. . . . 
. . . . . . . . . . . . . 
± 20 mA 
Output 
clamp current, 
10K 
(VO < 0 or Vo > VCC) 
± 50 mA 


Continuous 
output 
current, 
10 (VO = 0 to VCC) . . . .. 
.. 
. .. 
. 
.. 
±50 
mA 


Continuous 
current through 
VCC or GND pins. 
. . . . . . . . . . . . . . . . . . . 
± 100 mA 
Operating free-air temperature 
range . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. 
ooC to 700C 
Storage temperature 
range 
- 65 °C to 150 °C 


tStresses beyond those listed under "absolute maximum ratings" may cause permanent damage to the device. These are stress ratings 
only and functional operation of the device at these or any other conditions beyond those indicated under "recommended operating 
conditions" 
is not implied. Exposure to absolute-maximum·rated conditions for extended periods may affect device reliability. 


NOTE 1: All voltage levels are with respect to GND. 


PARAMETER 
MIN 
NOM 
MAX 
UNIT 


VCC 
Supply voltage 
4.75 
5.0 
5.25 
V 


VIH 
High-level input voltage 
2 
VCC 
V 


VIL 
Low-level input voltage 
0 
0.8 
V 


10H 
High-level output current 
-8 
mA 


10L 
Low-level output current 
8 
mA 


VI 
Input voltage 
0 
VCC 
V 


Vo 
Output voltage 
0 
VCC 
V 


dtldv 
Input transition rise or fall rate 
0 
15 
nsiV 


TA 
Operating free-air temperature 
0 
70 
ac 


electrical characteristics over recommended operating free-air temperature range (unless otherwise 
noted) 


PARAMETER 
TEST CONDITIONS 
VCC 
MIN 
TYP' 
MAX 
UNIT 


4.75 V 
4.6 


10H = -20 
p.A 
5.25 V 
5.1 


VOH 
V 
4.75 V 
3.85 
3.95 


10H = -8 
mA 
5.25 V 
4.60 
4.70 


10L = 20 p.A 
4.75 V 
0.1 


5.25 V 
0.1 
VOL 
V 
4.75 V 
0.32 
0.45 


10L = 8 mA 
5.25 V 
0.32 
0.45 


lit 
V, - 
VCC or 0 
5.25 V 
±0.1 
±1 
p.A 


ICCQ 
VI = VCC or 0 
5.25 V 
100 
200 
p.A 


CI 
VI = VCC or 0 
5V 
10 
pF 


~ICC§ 
One input at 3.4 V, 


5.25 V 
1 
mA 
other inputs at 0 or VCC 


tFor 1/0 ports, the parameters 10ZH and 10ZL include the offstate output curent. 
§This is the increase in supply current for each input that is at one of the specified TTL voltage levels rather than 0 or VCC' 
'AII typical values are at VCC = 5 V, TA = 25 ac. 
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switching characteristics over recommended ranges of supply voltage and operating free-air temperature 
(unless otherwise 
noted) t 


PARAMETER 
FROM 
TO 
MIN 
TYP* 
MAX 
UNIT 


Data in 
7 
14 


MSCLK, 
LSCLK 
10 
18 


SELDMS, 
SELDLS 
Data out 
9 
15 


CRCLK 
12 
19 


tpd 
CRSEL3-CRSELO 
12 
19 
ns 


CREAD2-CREADO 
10 
18 


CRCLK 
CNTRn 
10 
18 


CRAD1,CRADO 
8 
16 


TP1, TPO 
All outputs 
10 
19 


TP1, TPO 
All outputs 
10 
15 


ten 
OED 
Data out 
7 
12 
ns 


DEC 
CNTRn 
8 
14 


TP1, TPO 
All outputs 
10 
15 


tdis 
OED 
Data out 
5 
8 
ns 


DEC 
CNTRn 
6 
10 


tSee Parameter Measurement 
Information for load circuit and voltage waveforms. 


*AII typical 
values are at VCC 
= 5 V, TA 
= 25°C. 


timing requirements over recommended 
ranges of supply voltage and operating free-air temperature 


(unless otherwise 
noted) 


PARAMETER 
MIN 
MAX 
UNIT 


tw 
Pulse duration 
LSCLK, MSCLK, 
CRCLK high or low 
10 
ns 


Data 
7 


CNTRn 
7 


SELDMS, 
SELDLS 
9 


tsu 
Setup time before 
CRCLKt 
CRADR1,CRADRO 
8 
ns 


CRSRCE, CRWRITE2-CRWRITEO 
9 


LSCLK, MSCLK 
10 


WE 
8 


tsu 
Setup time before 
LSCLKt or MSCLKt 
Data 
7 
ns 


Data 
0 


CNTRn 
0 


Hold time after CRCLKt 


SELDMS, 
SELDLS 
0 
th 
CRADR1, 
CRADRO 
0 
ns 


CRSRCE, CRWRITE 
0 


WE 
0 


th 
Hold time after LSCLKt or MSCLKt 
Data 
0 
ns 
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TIMING 
CLOADt 
IOL 
IOH 
VLOAD 


PARAMETERS 
(pF) 
(mA) 
(mA) 
(V) 


tPZH 
0 


ten 
tpZL 
50 
8 
-8 
3 


tdls 
tpHZ 


tpLZ 
50 
8 
-8 
1.5 


ted 
50 
8 
-8 
* 
t CLOAD includes probes and test fixture capacitance. 
* VLOAD - VOL = 50 Q, where VOL = 0.6 V, IOL = 8 mA. 


IOL 


Timing 
..I 
\ 


(See N~~:~~ ----~4+-. 
~5 ~ 
__ 
,,- 


tsu~th 


Data 
~.7VI 
2.7V 
--- 


Input 
1.5 V 
vi 
1.5 V 
0.3 V 
0.3 


tr=+l 
i+- 
41 I+-tf 


VOLTAGE WAVEFORMS 
SETUP AND HOLD TIMES 
INPUT RISE AND FALL TIMES 


----,- 
---- 
3V 


Input 
L 1.5 V 
""- 
1.5 V 


(See Note A) --Ii 
1\.----0 
V 


tpd 
14 
.1 
~ 
tpd 


I 1 
! 
1_- 
VOH 


In·Phase 
I 
1.5 V 
1 
'i. 
1.5 V 
Output 
. 
\.......:. 
--1-- 
1 
VOL 


I~~, 
~tpd 


tpd~ 
1 
{ 
!C 
VOH 


\. 
1.5V 
T 
1.5V 
.•.•.----'.- 
- 
- 
VOL 


Out·of·Phase 


Output 


~ 


--3V 


1.5V 
1.5V 
I 
I 
OV 


14----- tw ---+I 
I 
I 


~ 


I 
3V 


1.5 V 
1.5 V 
____ 
OV 


VOLTAGE WAVEFORMS 
PULSE DURATION 


Output 
Control 
(Iow·level 
enabling) 


Waveform 
1 
(See Note B) 


1.5 V 


~---- 
OV 


tpZL ---.J 
~ 
1 
I 
I 
tPLZ-+j 
I+- 


__ 
I.•.•~I 
U 
3V 
I \I 
I 
1.••.•---1.5V 
I \ 
1.5V 
I L~L~~VVOL 
I 
tpHZ-+I 
I+- 


tpZH -+I 
1+- 
I 
...i 
I 


~~OH 


____ 
.-~~-----. 
~~.5V 


Waveform 
2 
(See Note C) 


Notes: 
A. Phase relationships between waveforms were chosen arbitrarily. All input pulses are supplied by pulse generators having the following 


characteristics: 
PRR = 1 MHz, Zo = 50 Q, tr " 6 ns, tf" 
6 ns. 


B. Waveform 
1 is for an output with internal conditions such that the output is low except when disabled by the output control. 


C. Waveform 2 is for an output with internal conditions such that the output is high except when disabled by the output control. For tpLZ 
and tpHZ, VOL and VOH are specified values. 


TEXAS 
." 
INSTRUMENTS 


SN74ACT8841A 
DIGITAL CROSSBAR SWITCH 


The' ACT8841 A and the bipolar SN74AS8840 
share the same architecture. 
Microcode 
for the' AS8840 
can be run on the' ACT8841 A if the additional 
control inputs to the' ACT8841 A are properly terminated. 
However, 
because the' ACT8841 A is a CMOS device with six additional 
control inputs, the' AS8840 
and 
the' 
ACT8841 A are not socket-compatible 
and cannot 
be used interchangably. 


The SN74AS8840 
and the SN74ACT8841 A digital crossbar switches essentially perform the same function. 


The' AS8840 
and the' ACT8841 A are based on the same 16-port 
architecture, 
differing 
in the number 
of control 
registers, 
power 
consumption, 
and pin-out. 


One difference 
is in the number of programmable 
control flip-flop 
banks available to configure 
the switch. 


The 'AS8840 
has two 
programmable 
control 
banks, while 
the 'ACT8841 
A has eight. 
Both have two 
selectable 
hard-wired 
switching 
configurations. 


The increased 
number of control 
banks in the' 
ACT8841 A require six additional 
pins not found 
on the 


'AS8840. 
These are: CRWRITE2, 
CRWRITE1, CREAD2, CREAD1, CRSEL3, and CRSEL2. CREAD and 
CRWRITE on the' AS8840 
become CREADO and CRWRITEOon the' ACT8841 A. On the' AS8840, 
CRSEL1 
selects the hardwired 
control functions 
when high. This function 
is performed 
by the CRSEL3 signal on 
the' 
ACT8841 A. Therefore, 
CRSEL2 and CRSEL1 are actually 
the added signals. 


The' ACT8841 A is a low-power 
CMOS device requiring only 5-V power. 
Because of its STL internal logic 
and TTL I/Os, the 'AS8840 
requires both 2-V and 5-V power. 


Both the' AS8840 
and the' ACT8841 A are in 156-pin grid-array packages; however, 
the two devices are 
not pin-for-pin 
compatible. 
Control signals were added to the' ACT8841 A and the 2-V VCC pins of the 


'AS8840 
were assigned other functions 
in the' 
ACT8841 A. 


changing 'AS8840 
microcode to .ACT8841 A microcode 


Since only six signals have been added to the 'ACT8841 
A, changing 
existing 
'AS8840 
microcode 
to 


•ACT8841 A microcode is straight forward. CRSEL3 on the' ACT8841 A is functionally 
equivalent to CRSEL1 
on the' AS8840. 
CREAD2, CREAD1, CRWRITE2, CRWRITE1, CRSEL2, and CRSEL1 bits must be added. 
These can always 
be 0 if no additional 
control 
banks are needed. Additional 
control 
configurations 
can 
be stored by programming 
these bits. 


All other signals in the' AS8840 
microcode 
remain the same when converting 
to 'ACT8841 
A microcode. 
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SN74ACT8847 
64-BIT 
FLOATING-POINT 
UNIT 


• 
Performs Floating-Point 
and Integer Add, 
Subtract, 
Multiply, 
Divide. Square Root. and 
Compare 


• 
Multiplier 
and ALU May Operate in Parallel 


• 
Performs Logical Operations 
and Logical 
Shifts 


• 
64-Bit IEEE Divide in 11 Cycles, 64-Bit 
Suare Root in 14 Cycles 


• 
30-ns SN74ACT8847-30 
and 40-ns 
SN74ACT8847-40 
Pipelined Performance 
for 66 MFLOPS and 50 MFLOPS. 
Respectively 


• 
Low-Power 
0.8-",m 
EPIC'· CMOS Process 


• 
Meets IEEE Standard for Single- and Double- 
Precision Formats 


description 


The SN74ACT8847 
is a high-speed, double-precision floating-point 
and integer processor. It performs high- 
accuracy, 
scientific 
computations 
as part of a customized 
host processor or as a powerful 
stand-alone 
device. Its advanced math processing capabilities 
allow the chip to accelerate the performance 
of both 
CISC- and RISC-based systems. 


High-end computer systems, such as graphics workstations, 
mini-computers and 32-bit personal computers, 
can utilize the single-chip' 
ACT884 7 for both floating-point 
and integer functions. 


The SN74ACT8847 
combines a multiplier and an arithmetic-logic 
unit in a single microprogrammable 
VLSI 
device. The 'ACT8847 
is implemented in Texas Instruments O.8-",m CMOS technology 
to offer high speed 
and low 
power 
consumption 
with 
exceptional 
flexibility 
and functional 
integration. 
The FPU can be 
microprogrammed 
to operate in multiple 
modes to support 
a variety 
of floating-point 
applications. 


The' ACT8847 
is fully compatible 
with the IEEE Std 754-1985 
for binary floating-point 
arithmetic. 
This 
FPU performs 
both single- and double-precision 
operations, 
integer operations, 
logical operations, 
and 
division 
and square root operations 
(as single microinstructions). 


understanding 
the 
•ACT884 
7 floating-point 
unit 


To support floating-point 
processing in IEEEformat, 
the' ACT8847 
may be configured 
for either single- 
or double-precision 
operation. Instruction 
inputs can be used to select three modes of operation, 
including 
independent 
ALU operations, 
independent 
multiplier 
operations, 
or simultaneous 
ALU and multiplier 
operations. 


Three levels of internal data registers are available. The device can be used in pipelined mode (all registers 
enabled) or in other available register configurations. 
An instruction 
register, a 64-bit 
constant 
register, 
and a status 
register are also provided. 


Each FPU can handle three types of data input formats. 
The ALU accepts data operands in integer format 
or IEEEfloating-point 
format. 
A third type of operand. denormalized numbers, can also be processed after 
the ALU has converted them to "wrapped" 
numbers. which are explained in detail in a later section. The 


'ACT8847 
multiplier 
operates 
on normalized 
floating-point 
numbers, 
wrapped 
numbers, 
and integer 
operands. 


microprogramming 
the 
•ACT884 
7 


The' ACT884 7 is a fully microprogrammable 
device. Each FPU operation is specified by a microinstruction 
or sequence of microinstructions 
which set up the control inputs of the FPU so that the desired operation 
is performed. 


PRDDUCTIDll 
DATA documonts cont.in 
inform.tion 
current •• of public.tion 
dlte. Products conform to 
.pacification. 
p.r tho term. 
of T••• s Instrumonts 
standard warranty. Production processing does not 
nac•••• rily includo tosting 
of oil p.r.moters. 
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SN74ACT8847 
64-BIT 
flOATING-POINT 
UNIT 


pin descriptions 


Pin descriptions 
and grid assignment for the' ACT884 7 are given on the following 
pages. The pin at location 


A 1 has been omitted 
for indexing 
purposes. 


207 PIN ... 
GA PACKAGE 


(TOP VIEW) 


2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 


A 
• • • • • • • • • • • • • • • • 


B • • • • • • • • • • • • • • • • • 


C • • • • • • • • • • • • • • • • • 


D • • • • • • • • • • • • • • • • • 


E 
• • • • 
• • • • 


F • • • • 
• • • • 


G • • • • 
• • • • 


H • • • • 
• • • • 


J 
• • • • 
• • • • 


K 
• • • • 
• • • • 


L • • • • 
• • • • 


M • • • • 
• • • • 


N • • • • 
• • • • 


p • • • • • • • • • • • • • • • • • 


R • • • • • • • • • • • • • • • • • 


S • • • • • • • • • • • • • • • • • 


T • • • • • • • • • • • • • • • • • 
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SN74ACT8847 


64-BIT 
FLOATING-POINT 
UNIT 


PIN 
PIN 
PIN 
PIN 
PIN 
PIN 


NO. 
NAME 
NO. 
NAME 
NO. 
NAME 
NO. 
NAME 
NO. 
NAME 
NO. NAME 


A1 
missing 
C2 
YO 
E3 
FAST 
J15 
FLOWC 
P1 
ENRC 
S1 
NC 
A2 
INF 
C3 
Y3 
E4 
GND 
J16 
SRCC 
P2 
PIPESO S2 
PBO 


A3 
Y5 
C4 
Y6 
E14 
GND 
J17 
BYTEP 
P3 
RESET 
S3 
DBO 


A4 
YB 
C5 
Y9 
E15 
AGTB 
Kl 
SELOP3 
P4 
PB1 
S4 
DB4 


A5 
Y11 
C6 
Y12 
E16 
AEQB 
K2 
SELOP4 
P5 
DB1 
S5 
DB11 
A6 
Y14 
C7 
Y15 
E17 
MSERR 
K3 
SELOP5 
P6 
DB5 
S6 
DB12 


A7 
Y17 
C8 
Y18 
F1 
15 
K4 
GND 
P7 
DB9 
S7 
DB15 
A8 
Y20 
C9 
Y23 
F2 
13 
K14 
GND 
P8 
DB16 
S8 
DB19 
A9 
Y21 
C10 
Y26 
F3 
RNDO 
K15 
PAl 
P9 
DB21 
S9 
DB23 


A10 
Y24 
Cl1 
Y30 
F4 
GND 
K16 
PA2 
P10 
DB28 
S10 
DB26 


All 
Y27 
C12 
PY1 
F14 
GND 
K17 
PA3 
P11 
DAO 
S11 
DB30 


A 12 Y29 
C13 
UNDER 
F15 
PERRA 
L1 
SELOP6 
P12 
DA4 
S12 
DA2 


A13 
PYO 
C14 
INEX 
F16 
OEY 
L2 
SELOP7 
P13 
DA8 
S13 
DA6 


A14 
PY3 
C15 
DENIN 
F17 
OES 
L3 
CLK 
P14 
DA12 
S14 
DA10 


A151VAL 
C16 
SRCEX 
G1 
17 
L4 
VCC 
P15 
DA19 
S15 
DA14 


A16 
NEG 
C17 
CHEX 
G2 
16 
L14 
GND 
P16 
DA22 
S16 
DA15 


A17 
NC 
D1 
11 
G3 
14 
L15 
DA30 
P17 
DA23 
S17 
DA17 


B1 
ED 
D2 
RND1 
G4 
VCC 
L16 
DA3l 
R1 
PIPES1 Tl 
NC 
B2 
Y2 
D3 
Yl 
G14 
VCC 
L17 
PAO 
R2 
HALT 
T2 
PB3 


B3 
Y4 
D4 
GND 
G15 
OEC 
M1 
ENRB 
R3 
PB2 
T3 
DB3 


B4 
Y7 
D5 
VCC 
G16 
SELMS/LS 
M2 
ENRA 
R4 
DB2 
T4 
DB7 


B5 
Y10 
D6 
GND 
G17 
TESTl 
M3 
CLKC 
R5 
DB6 
T5 
DB8 


B6 
Y13 
D7 
GND 
H1 
110 
M4 
GND 
R6 
DB10 
T6 
DB13 


B7 
Y16 
DB 
VCC 
H2 
19 
M14 
VCC 
R7 
DB14 
T7 
DB17 


B8 
Y19 
D9 
GND 
H3 
18 
M15 
DA27 
R8 
DB18 
T8 
DB20 


B9 
Y22 
D10 
GND 
H4 
GND 
M16 
DA28 
R9 
DB22 
T9 
DB24 


B10 
Y25 
D1l 
VCC 
H14 
GND 
M17 
DA29 
R10 
DB27 
TlO 
DB25 


B11 
Y28 
D12 
GND 
H15 
TESTO 
N1 
CONFIGO 
Rl1 
DB31 
Tll 
DB29 
B12 
Y31 
D13 
GND 
H16 
SELSTl 
N2 
CONFIG1 
R12 
DA3 
Tl2 
DA1 
B13 
PY2 
D14 
VCC 
H17 
SELSTO 
N3 
CLKMODE 
R13 
DA7 
Tl3 
DA5 
B14 
OVER 
D15 
STEX1 
J1 
SELOP2 
N4 
PIPES2 
R14 
DA11 
Tl4 
DA9 


B15 
RNDCO 
D16 
STEXO 
J2 
SELOP1 
N14 
DA18 
R15 
DA16 
T15 
DA13 


B16 
DENORM 
D17 
UNORD 
J3 
SELOPO 
N15 
DA24 
R16 
DA20 
Tl6 
NC 


B17 
DIVBYO 
E1 
12 
J4 
VCC 
N16 
DA25 
R17 
DA21 
Tl7 
NC 


C1 
PERRB 
E2 
10 
J14 
VCC 
N17 
DA26 
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SN74ACT8847 
64·BIT 
FLOATING-POINT 
UNIT 


CLKMODE 


BYTEP 


CONFIG1-0 


FAST 


RND1-0 


SRCC 


ENRC 


FLOWC 


SELOP7-0 


SELST1-0 


SELMS/LS 


CLK 


CLKC 


•• 
64-Bit 
Floating-Point 
Unit 


'ACTB847 


CLEARS 
STATES 
/1 
MASTER 
CLOCK 
(EXCEPT 
C REGISTER) 
& STATUS 


C REGISTER 
CLOCK 
STALLS 
OPERATION 
/1 


CLOCK 
EDGE 
INSTRUCTION, 
RA, 
& RB !FLOWTHROUGH 
d-=:J 
PARITY 
GENERATION 
REGISTERS 
EN 


DATA 
SOURCE 
ALU, 
MULTIPLIER" 


Lr::::. 
SUDDEN 
!UNDER- 
AND 
INSTRUCTION 
FLOWTHROUGH 
~ 
GRADUAL 
FLOW 
PIPELINE 
REGISTERS 
EN 


ROUNDING 
MODE 
SELECT 
STATUS, 
P, S'I 
F 
WTHROUGH 
AND 
INST 
PIPELINE 
LO 
~ 


~ 


MULTIPLIER 
REGISTERS 
EN 


ALU 
C REG 
DA 
DATA 
4-,- 
WRITE 
PARITY 
4 
BYPASS 
1/0 
DB 
DATA 
4' 
B 
, 
8'/ 
OPERAND 
SOURCE 
Y BUS 
, 


STATUS 
SOURCE 


MSHI 
I 
DA 
DATA 
L-t::". 
LSH 
Y BUS 
STATUS 
DB 
DATA 
PARITY 
MASTER/SLAVE 
2/ 
COMPARATOR 
/ 
0 
COMPARISON 


STATUS 


INSTRUCTION 
- 


EXCEPTION 
· 


10 
AND 
· 


OTHER 


LOAD 
RA 
REGISTER 
STATUS 


r--.. 
LOAD 
RB REGISTER 
· 
r--.. 
EXCEPTION 
& OTHER 
STATUS 
EN 
COMPARISON 
STATUS 
r-..... 


Y31-YO, 
PY3-PYO 
.., 
r 


0 
0 
• 
· 
I 
DATA > 
< 
> · 
· 
• 
• 
DATA 
· 
• 
• 
• 
· 
• 
31 
31 


0 
· 
• 
I 
DATA > 
• 
• 
• 
• 
31 


TP1-0 


10 
11 


12 
13 


14 
15 


16 


17 


18 


19 


110 


ENRA 


ENRB 


OES 


OEC 


OEY 


PA3-0 


PB3-0 


PY3-0 


PERRA 


PERRB 


MSERR 


UNORD 


AGTB 


AEOB 


ED 


DIVBYO 


IVAL 


INEX 


OVER' 


UNDER 


DENORM 


DENIN 


RNDCO 


SRCEX 


CHEX 


STEX1-0 


NEG 


INF 
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FLOATING-POINT 
UNIT 


110·10 


SELOP7 


SELOPO 


PtPES2 
PIPES1 


FAST 
AND1·RNDO 


----------+- 
Flowe 


-----+- HALT 


------4- 
BYTEP 


-----+-ClK 
-----+- 
PIPESO 


-- 
......•..ClKMooe 


-----+ RESEf 


~TP1-TPO 
10 
--r---=--+- 
Vec 


~GNO 


FROM 
INSTRUCTION 
- 
-- 
ptPELINE 


REGISTEA@ 


EO 


OIVBYP 
IVAl 
INEX 


OVER 


UNDER 


OENQAM 


DENIN 


RNOCO 
SAC EX 
eHU 


STEX 
'·$TEXO 


NED 
INF 
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SN74ACT8847 
54·BIT FLOATING-POINT 
UNIT 


PIN 
I/o/Zt 
DESCRIPTION 


NAME 
NO. 


DATA BUS SIGNALS 196 PINS) 


DAO 
P11 


DA1 
T12 


DA2 
S12 


DA3 
R12 


DA4 
P12 


DA5 
T13 


DA6 
513 


DA7 
A13 


DAB 
P13 


DA9 
T14 


DA10 
514 


DA11 
R14 


DA12 
P14 


DA13 
T15 


DA14 
515 


DA15 
516 


DA16 
R15 
I 
DA 32-bit input data bus. Data can be latched in a 64-bit temporary register or loaded directly into an input 


517 
register. 


DA17 


DA1B 
N14 


DA19 
P15 


DA20 
A16 


DA21 
A17 


DA22 
P16 


DA23 
P17 


DA24 
N15 


DA25 
N16 


DA26 
N17 


DA27 
M15 


DA2B 
M16 


DA29 
M17 


DA30 
L15 


DA31 
L16 


DBO 
53 


DB1 
P5 


DB2 
R4 


DB3 
T3 


DB4 
54 


DB5 
P6 


DB6 
R5 
I 
DB 32-bit input data bus. Data can be latched in a 64-bit temporary register or loaded directly into an input 


DB7 
T4 


register. 


DBB 
T5 


DB9 
P7 


DB10 
R6 


DB11 
55 


DB12 
56 
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SN74ACT8847 


64·BIT 
FLOATING-POINT 
UNIT 


PIN 
IIOIZ' 
DESCRIPTION 
NAME 
NO. 


DATA 
BUS SIGNALS (96 PINS) 


0813 
T6 
0814 
R7 
0815 
57 
0816 
P8 
0817 
17 
0818 
R8 
0819 
58 
0820 
T8 


0821 
P9 
0822 
R9 
I 
08 32-bitinputdatabus.Oatacan be latchedina 64-bittemporaryregister 
orloadeddirectly 
intoan input 


0823 
59 
register. 


0824 
T9 
0825 
T10 
0826 
510 
0827 
R10 
0828 
P10 
0829 
T11 
0830 
511 
0831 
Rl1 


YO 
C2 
Y1 
03 
Y2 
82 
Y3 
C3 
Y4 
83 
Y5 
A3 
Y6 
C4 
Y7 
84 
Y8 
A4 
Y9 
C5 
Y10 
85 
Y11 
A5 
Y12 
C6 


I/O/Z 
32-bitY outputdatabus 
Y13 
86 
Y14 
A6 
Y15 
C7 
Y16 
87 
Y17 
A7 
Y18 
C8 
Y19 
88 
Y20 
A8 
Y21 
A9 
Y22 
89 
Y23 
C9 
Y24 
A10 
Y25 
810 
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SN74ACT8847 
64·BIT 
FLOATING-POINT 
UNIT 


PIN 
IIOllt 
DESCRIPTION 
NAME 
NO. 


DATA BUS SIGNALS (96 PINS) 


Y26 
Cl0 


Y27 
All 


Y28 
Bl1 
IIOll 
32-bit Y output data bus 
Y29 
A12 


Y30 
Cll 


Y31 
B12 


PARITY AND MASTERISLAVE SIGNALS (16 PINS) 


BYTEP 
J17 
I 
When high, selects parity generation for each byte of input (four parity bits for each bus). When low, 


selects parity generation for whole 32-bit input (one parity bit for each bus). Even parity is used. 


MSERR 
E17 
0 
MasterlSlave error output pin 


PAO 
L17 


PAl 
K15 
I 
Parity inputs for DA data 
PA2 
K16 


PA3 
K17 


PBO 
S2 


PBl 
P4 
I 
Parity inputs for DB data 
PB2 
R3 


PB3 
T2 


PERRA 
F15 
0 
DA data parity error output. When high, signals a byte or word has failed an even parity check. 


PERRB 
Cl 
0 
DB data parity error output. 
When high, signals a byte or word 
has failed an even parity 
check. 
PYO 
A13 


PYl 
C12 
IIOll 
Y port parity data 
PY2 
B13 


PY3 
A14 


CLOCK, CONTROL, AND INSTRUCTION SIGNALS (46 PINS) 


CLK 
L3 
I 
Master 
clock 
for 
all registers 
except 
C register 


CLKC 
M3 
I 
C register clock 


Selects whether temporary register loads only on rising clock edge (CLKMODE = L) or on falling edge 
CLKMODE 
N3 
I 
(CLKMODE = HI. 


CONFIGO 
Nl 
I 
Select data sources for RA and RB registers from DA bus, DB bus and temporary register 
CONFIGl 
N2 


ENRA 
M2 
I 
When high, enables loading of RA register on a rising clock edge if the RA register is not disabled (see PIPESO 
below). 


ENRB 
Ml 
I 
When high, enables loading of RBregister on a rising clock edge if the RBregister is not disabled (see PIPESO 
below). 


ENRC 
Pl 
I 
When low, enables write to C register when CLKC goes high. 


FAST 
E3 
I 
When low, selects gradual underflow 
(IEEE model). When high, selects sudden underflow, 
forcing all 


denormalized 
inputs 
and outputs 
to zero. 


When 
high, 
causes 
product 
or sum 
to bypass 
C register, 
so that 
product 
or sum 
appears 
on the 
C register 


FLOWC 
J15 
I 
output 
bus. 
Timing 
is similar 
to P register 
or S register 
feedback 
operands. 
C register 
remains 
unchanged. 


Product or sum may also be simultaneously fed back in usual manner (not through C register!. 


HALT 
R2 
I 
Stalls 
operation 
without 
altering 
contents 
of instruction 
or data 
registers 
(except 
the 
CREG, 
which 
has 
a 


separate 
write 
enable). 
Active 
low. 


10 
E2 


I 
Instruction 
inputs 
11 
01 
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SN74ACT8847 
64·BIT 
FLOATING-POINT 
UNIT 


PIN 
I/O/Zt 
DESCRIPTION 


NAME 
NO. 
CLOCK, CONTROL, AND INSTRUCTION SIGNALS (46 PINS) 


12 
E1 


13 
F2 


14 
G3 


15 
F1 


16 
G2 
I 
Instruction 
inputs 


17 
G1 


18 
H3 


19 
H2 


110 
H1 


OEC 
G15 
I 
Comparison 
status 
output 
enable. 
Active 
low. 


OES 
F17 
I 
Exception status and other status output enable. Active 
low. 


OEY 
F16 
I 
Y bus output enable. Active low. 


PIPESO 
P2 
I 
When low, enables instruction register and, depending on setting of ENRAand ENRB,the RA and RB input 


registers. When high, puts instruction, 
RA and RB registers in flowthrough 
mode. 


PIPES1 
R1 
I 
When low, enables pipeline registers in ALU and multiplier. When high, puts pipeline registers in flowthrough 


mode. 


PIPES2 
N4 
I 
When low, enables status register, product (PI and sum (SI registers. When high, puts status register, P 


and S registers in flowthrough 
mode. 


RESET 
P3 
I 
Clears 
internal 
states, 
status, 
and exception 
disable 
register. 
Contents 
of internal 
pipeline 
registers 
are lost. 


Does 
not 
affect 
other 
data 
registers. 
Active 
low. 


RNDO 
F3 
i 
Rounding mode control pins. Select four IEEErounding modes. 
RND1 
02 


RNDCO 
B15 
I/Oll 
When 
high. 
indicates 
the 
mantissa 
of a number 
has 
been 
increased 
in magnitude 
by 
rounding. 


SELOPO 
J3 


SELOP1 
J2 


SELOP2 
J1 


SELOP3 
K1 
I 
Select operand sources for multiplier and ALU 
SELOP4 
K2 


SELOP5 
K3 


SELOP6 
L1 


SELOP7 
L2 


SELSTO 
H17 


I 
Select 
status 
source 
during 
chained 
operation 
SELSTl 
H16 


SELMS/LS 
G16 
I 
When low, selects LSH of 64-bit result to be output on the Y bus. When high, selects MSH of 64-bit result. 


(No effect on single-precision operations.) 


SRCC 
J16 
I 
When low, selects ALU as data source for C register. When high, selects multiplier as data source for C 


register. 


TESTa 
H15 
Test pins. Tied high for normal operation. 
I 
TESTl 
G17 


STATUS SIGNALS (17 PINS) 


AEQB 
E16 
IIOll 
Comparison 
status 
or zero detect 
pin. When 
high, 
indicates 
that 
A and B operands 
are equal 
during 
a compare 


operation in the ALU. If not a compare, a high signal indicates a zero result on the Y bus. 


AGTB 
E15 
IIOll 
Comparison status pin. When high, indicates that A operand is greater than B operand. 


CHEX 
C17 
IIOll 
Status 
pin indicating 
an exception 
during 
a chained 
function. 
If 16 is low, 
indicates 
the 
multiplier 
is the source 


of an exception. If 16 is high, indicates the ALU is the source of an exception. 
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PIN 


I/O/z' 
DESCRIPTION 


NAME 
NO. 


STATUS SIGNALS (17 PINS) 


DENIN 
C15 
I/Oll 
Status pin indicating a denormal input to the multiplier. When DENIN goes high, the STEX pins indicate 


which port had the denormal input. 


Status pin indicating a denormal output from the ALU or a wrapped output from the multiplier. In FAST 
DENORM 
B16 
I/Oll 
mode, causes the result to go to zero when DENORM is high. 


DIVBYO 
B17 
I/Oll 
Status 
pin 
indicating 
an 
attempted 
operation 
involved 
dividing 
by zero 


ED 
B1 
I/Oll 
Exception detect status signal representing logical ORof all enabledexceptions in the exception disable register 


INEX 
C14 
I/Oll 
Status 
pin indicating 
an inexact 
output 


INF 
A2 
I/Oll 
Status 
pin. 
When 
high, 
indicates 
output 
value 
is infinity. 


IVAL 
A15 
I/Oll 
Status pin indicating that an invalid operation or a nonnumber (NaN) has been input to the multiplier or ALU. 


NEG 
A16 
I/Oll 
Status pin. When high, indicates result has negative sign. 


OVER 
B14 
I/Oll 
Status pin indicating that the result is greater the largest allowable value for specified format (exponent 


overflow). 


SRCEX 
C16 
I/Oll 
Status pin indicating source of exception, either ALU (SRCEX = LI or multiplier (SRCEX = HI. 


STEXO 
016 
Status pins indicating that a nonnumber (NaN) or denormal number has been input on A port (STEX1) or 
I/Oll 
STEX1 
B port (STEXO). 


UNDER 
C13 
I/Oll 
Status 
pin indicating 
that 
a result 
is inexact 
and 
less than 
minimum 
allowable 
value 
for 
format 
(exponent 


underflow). 


UNORD 
D17 
I/Oll 
Comparison 
status 
pin 
indicating 
that 
the 
two 
inputs 
are 
unordered 
because 
at 
least 
one 
of 
them 
is a 


nonnumber (NaN). 


SUPPLY AND NIC SIGNALS (33 PINS) 


VCC 
05 


VCC 
DB 


VCC 
D11 


VCC 
D14 


VCC 
G4 
I 
5-V supply voltage pins 


VCC 
G14 


VCC 
J4 


VCC 
J14 


VCC 
L4 


VCC 
M14 


GND 
D4 


GND 
D6 


GND 
D7 
GND 
D9 
GND 
Dl0 
GND 
D12 


GND 
D13 


GND 
E4 


GND 
E14 
I 
Ground pins. NOTE: All ground pins should be used and connected. 


GND 
F4 


GND 
F14 


GND 
H4 


GND 
H14 


GND 
K4 


GND 
K14 
GND 
L14 
GND 
M4 
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PIN 
I/O/Z' 
DESCRIPTION 
NAME 
NO. 


SUPPLY AND NIC SIGNALS (33 PINS) 


NC 
A17 


NC 
S1 


NC 
T1 
No internal 
connection. 
Pins should 
be left floating. 


NC 
T16 


NC 
T17 


introduction 


Designing with the' ACT884 7 floating-point 
unit (FPU) requires a thorough 
understanding 
of computer 
architectures, 
microprogramming, 
and IEEEfloating-point 
arithmetic, 
as well as a detailed knowledge of 
the' ACT884 7 itself. This introduction 
presents a brief overview of the' ACT884 7 and discusses a number 
of issues when designing and programming 
with this FPU. 


major architectural features 


The overall architecture 
for a floating-point 
system is determined by a combination 
of design factors. The 
principal consideration 
is the set of performance targets that the floating-point 
processor has to achieve, 
usually expressed in terms of clock cycle period, operating mode (vector or scalar), and operand precision 
(32 bit, 64 bit, or other). Of almost equal importance are design constraints of cost, complexity, chip count, 
power consumption, 
and requirements for interfacing 
to other processors. 


The architecture 
of the' ACT884 7 is optimized to satisfy several processing and interface requirements. 


The FPU has two 32-bit input buses, the DA and DB data buses, and one 32-bit output bus, the Y bus. 
This 3-port design provides much greater I/O bus bandwidth than can be achieved by a single-port device 
. (one 32-bit I/O bus). Two single-precision 
inputs can be simultaneously 
loaded on the input buses while 
a result is being output 
on the Y bus. 


Internally, 
the 'ACT884 7 FPU consists of two 
main functional 
blocks: the multiplier 
and the ALU (see 
Figure 1). Either the multiplier or the ALU can operate independently, 
or the two functional 
units can be 
used simultaneously 
in "chained" 
mode. When operating independently, 
each block of the FPU performs 
a separate set of arithmetic or logical functions. The multiplier supports multiplication, 
division, and square 
roots. The ALU supports addition, subtraction, 
format conversions, logical operations, and shifts. Integer 
division and integer square root require both the multiplier and the ALU; the final result comes from the ALU. 


In chained mode, a multiplier operation executes in parallel with an ALU operation. Possible examples include 
calculations of a sum of products (multiply and accumulate) or a product of sums (add and then multiply). 
The sum of products computation 
requires a total of four operands: two new inputs to be multiplied, the 
sum of previous products, 
and the current product to be added to the sum, as shown in Table 1. 


MULTIPLIER OPERATION 
ALU OPERATION 
A • B 
- 


C • D 
IA • BI + 0 


E • F 
IC • DJ + IA • BI 
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Because the' ACT884 7 has multiple internal data paths and data registers, this sum of products 
can be 
generated by simultaneous 
operations on new bus data and internal feedback, 
without 
the necessity of 
storing either the previous accumulation 
or the current product off chip. Data flow for the sum of products 
calculation 
is shown in Figure 2. 


Because the' ACT884 7 has multiple internal data paths and data registers, this sum of products 
can be 
generated by simultaneous 
operations on new bus data and internal feedback, 
without 
the necessity 
of 
storing either the previous accumulation 
or the current product off chip. Data flow for the sum of products 
calculation 
is shown in Figure 2. 


data flow in pipelined architectures 


Several levels of internal data registers are available to segment the internal data paths of the' ACT884 7. 
Enabling one or more registers divides the data paths so that data can be clocked into internal registers, 
instead of from an external source to an external destination. 
Enabling the input registers permits data 
and instruction 
inputs to be registered on chip. Also, the hardware division and square root operations 
which the' ACT884 7 performs 
require that the input registers be enabled. 


In the main data paths, three sets of internal registers are available in the' ACT884 7: input registers, pipeline 
registers in the multiplier and ALU logic blocks, and output registers to capture results from the multiplier 
and the ALU. When all three levels of data registers are enabled, the register-to-register 
delay inside the 
device is minimized. This is the fastest 
operating mode, and in this configuration 
the' ACT884 7 is said 
to be "fully 
pipelined." 
While one instruction 
is executing, 
the next instruction 
along with its associated 
operands may be input to device so that overlapped operations 
occur (see Figure 3). 


The selection of operating mode determines the latency from input to output, the number of clock cycles 
required for inputs to be processed and results to appear. For each register level enabled in the data path, 
one clock cycle is added to the latency from input to output. 


control architectures 
for high-speed microprogrammed 
architectures 


A separate control circuit is required to sequence the operation of the' ACT884 7. A sequencer function 
within the control circuit controls both the sequencer and FPU as determined by FPU status outputs. 
Either 
a standard microsequencer 
such as the SN74ACT8818A, 
or a custom controller 
such as a PLA or gate 
array can be used to control the FPU. Figure 4 shows an example block diagram for a PLA control circuit. 


If a standard microsequencer is used, execution addresses for routines stored in the microprogram memory 
are generated by the microsequencer. 
As its name implies, microprogram 
memory stores the sequences 
of microinstructions 
which control FPU execution. 
The' ACT884 7 can be programmed 
by generating all 
control 
bits in a given microinstruction 
to select a FPU operation. 
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INSTRUCTION 
INPUTS I 
A_+_B 
C_+_D 
E_+_F 
_ 


DATA 
INPUTS 
I~_A_, 
_B 
C_'_0 
E_,_F 
_ 


INPUT 
REGISTER I 
CONTENTS ~----~ 


ALU 
PIPELINE I 
REGISTER 
CONTENTS ~----------- 


OUTPUT 
SUM I 
REGISTER 
CONTENTS 
A_+_B 
C_+_D 
E_+_F __ 


PROGRAMMABLE 


LOGIC 
ARRAY 


(PLA) 


One possible control circuit for the' ACT884 7 consists of a microsequencer, microprogram memory, and 
one or more microinstruction 
registers, together with status logic as required to support a specific floating- 
point implementation, 
A control circuit without 
an instruction 
register is typically too slow for use with 
the' ACT884 7, At least one microinstruction 
register is used to hold the current instruction being executed 
by the FPU and sequencer (see Figure 5). 


Inclusion of the microinstruction 
register divides the critical path from the sequencer through the program 
memory to the FPU control inputs, permitting much faster execution times, However, when all the internal 
registers of the FPU are enabled, FPU operation may be fast enough to require a second register in the 
control circuit, 
In this case, a register on the output bus of the sequencer captures each microprogram 
address, and the microinstruction 
register captures each microinstruction 
(see Figure 6), 


TEXAS ~ 
INSTRUMENTS 


SN74ACT8847 
64-BIT 
FLOATING-POINT 
UNIT 


TEXAS ~ 
INSTRUMENTS 


SN74ACT8847 
64·BIT 
FLOATING-POINT 
UNIT 


Introducing 
registers in the FPU data paths and the control circuit complicates 
I/O timing, 
status output 
timing, the status logic, and the microprogram 
for the FPU and the sequencer. These timing relationships 
affect 
branches, jumps to subroutine, 
and other operations 
depending on FPU status. 
Some of these 
programming 
issues are discussed below. 


microprogram 
control of an .ACT884 
7 FPU subsystem 


A microprogram 
to control the' ACT884 7 must take into account not only the FPU operation but also the 
sequencer operation, especially when the system is performing a branch on status or handling an exception. 


Several options are available for dealing with such exceptions. The' ACT884 7 can be programmed to discard 
operands in invalid formats 
and some exceptions 
caused by illegal operations. 
In general, though, 
the 
microprogram 
should be designed to handle a range of status 
results or exceptions. 
Hardware 
timing 
considerations 
such as pipeline delays in both control 
and data paths must be studied to minimize the 
difficulty 
of performing 
branches to status exception 
handlers. 


The 'ACT8847 
accepts 
either 
operands 
as normalized 
IEEE floating-point 
numbers, 
(ANSI/IEEE Std 
754-1985), 
unsigned 32-bit integers, or 2's complement 
integers. Floating-point 
operands may be either 
single precision (32 bits) or double precision (64 bits). 


IEEE formats 
for floating-point 
operands, both single and double precision, consist of three fields: sign, 
exponent, and fraction, 
in that order. The leftmost 
(most significant) 
bit is the sign bit. The exponent field 
is 8 bits long in single-precision 
operands and 11 bits long in double-precision 
operands. The fraction 
field 
is 23 bits in single precision and 52 bits in double precision. The value of the fraction 
contains a hidden 
bit, an implicit 
leading "1", 
as shown below: 


The representation 
of a normalized floating-point 
number is: 


(-l)S 
* 1.f * 2{e-bias) 


where the bias is either 127 for single-precision 
operands or 1023 for double-precision 
operands. 


The formats for single-precision 
and double-precision 
numbers are shown in Figures 7 and 8, respectively. 


Further details of IEEEformats and exceptions 
are provided in the IEEEStandard for Binary Floating-Point 
Arithmetic, 
ANSI/IEEE Std 754-1985. 
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~~-----~ 
s: sign of fraction 
e: 8-bit exponent 
biased by 127 
I: 
23-bit Iraction 


~------ 
s: sign of fraction 
e: 
11-bit exponent 
biased by 1023 
I: 
52-bit Iraction 


The' ACT884 7 also handles two other operand formats which permit operations with very small floating- 
point numbers. The ALU accepts denormalized floating-point 
numbers, that is, floating-point 
numbers so 


small that they could not be normalized. If these denormal operands are input to the multiplier, 
they will 


cause status exceptions. 
Denormals can be passed through the ALU to be "wrapped" 
and the wrapped 


operands can then be input to the multiplier. 


A denormalized input has the form of a floating-point 
number with a zero exponent, a nonzero mantissa, 
and a zero in the leftmost 
bit of the mantissa (hidden or implicit bit). Using single precision, a denorm is 


equal to: 


For double precision, a denorm is equal to: 


(- 
1)S * (2) - 1022 
* fraction 


A denormalized number results from decrementing the biased exponent field to zero before normalization 
is complete. 
Since a denormalized number cannot be input to the multiplier, 
it must first be converted 


to a wrapped number by the ALU. A wrapped number is a number created by normalizing a denormalized 
number's fraction field and subtracting from the exponent the number of shift positions (minus one) required 
to do so. The exponent is encoded as a two's 
complement 
negative number. When the mantissa of the 


denormal is normalized by shifting it left, the exponent field decrements from all zeros (wraps past zero) 
to a negative two's 
complement 
number (except in the case of 0.1 XXX ... , where the exponent 
is not 


decremented). 


Floating-point 
formats 
handled by the 'ACT884 7 are presented in Table 2. 
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TYPE OF 
EXPONENT (e) 
FRACTION (f) 
HIDDEN 
VALUE OF NUMBER REPRESENTED 


OPERAND 
SP (HEX) DP (HEX) 
(BINARY) 
BIT 
SP (DECIMAL)t 
DP (DECIMAL)t 


Normalized 
FE 
7FE 
All 1's 
1 
(-1)s 
(2127) 12-2-23) 
1-1)s 
(21023) 
(2-2-52) 


Number (max) 


Normalized 
01 
001 
All D's 
1 
(-1)s 
(2 - 126) (1) 
l-l)S 
12-1022) 
(1) 
Number (min) 


Denormalized 
00 
000 
Aill's 
0 
(l-)S 
(2-126) 
11-2-23) 
(-l)S 
(2-1022) 
(1-2-52) 


Number (maxi 


Denormalized 
00 
000 
000 ...001 
0 
(- l)S (2 - 126) (2 - 23) 
(-1)s 
(2 -1022) 
(2 - 52) 


Number (min) 


Wrapped 
00 
000 
All l's 
1 
l-l)S 
12-127)12_2-23) 
( - l)S (2 - 1023) (2 - 2 - 52) 


Number Imax) 


Wrapped 
EA 
7eD 
All D's 
1 
1-1)s 
(2-(22+127)) 
(1) 
(-1)s 
(2-(51 
+1023)) 
(1) 


Number (min) 


Zero 
00 
000 
Zero 
0 
(-l)S 
10.0) 
l-l)S 
(0.0) 


Infinity 
FF 
7FF 
Zero 
1 
1- l)S (infinity) 
(- l)S (infinity) 


NaN (Not a 
FF 
7FF 
Nonzero 
N/A 
None 
None 
Number) 


status 
outputs 


Status flags are provided to signal both floating-point 
and integer results. Integer status is provided using 
AEQB for zero, NEG for sign, and OVER for overflow/carryout. 


Status exceptions 
can result from one or more error conditions 
such as overflow, 
underflow, 
operands 
in illegal formats, 
invalid operations, 
or rounding. 
Exceptions may be grouped into two 
classes: input 
exceptions 
resulting from invalid operations or denormal inputs to the multiplier, 
and output exceptions 
resulting from illegal formats, 
rounding errors, or both. 


architecture 
overview 


The' ACT884 7 is a high-speed floating-point 
unit implemented in TI's advanced O.8-jlm 
CMOS technology. 


The device is fully compatible with IEEEStandard 754-1985 for addition, subtraction, multiplication, division, 
square root, and comparison. 


The' ACT884 7 FPU also performs integer arithmetic, 
logical operations, and logical shifts. Absolute value 
conversions, 
floating-point 
to integer conversions, 
and inte.ger to floating-point 
conversions 
are also 
available. The ALU and multiplier are both included in the same device and can be operated in parallel to 
perform sums of products and products of sums (see the functional 
block diagram). 


IEEEformatted 
denormal numbers are directly handled by the ALU. Denormal numbers must be wrapped 
by the ALU before being used in multiplication, 
division, or square root operations. A fast mode in which 
all denormals are forced to zero is provided for applications 
not requiring gradual underflow. 


The' ACT884 7 input buses can be configured to operate as two 32-bit data buses or as a single 54-bit 
bus, providing a number of system interface options. 
Registers are provided at the inputs, outputs, 
and 
inside the ALU and multiplier 
to support 
multilevel 
pipelining. 
These registers 
can be bypassed 
for 
nonpipelined operation. 


A clock mod'~ control allows the temporary 
input register to be clocked on the rising edge or the falling 
edge of the clock to support double-precision ALU operations at the same rate as single-precision operations. 
A feedback register (C register) with a separate clock is provided for temporary internal storage of a multiplier 
result, ALU result or constant. 
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Four multiplexers 
select the multiplier 
and AlU 
operands from the input registers, C register, or previous 
multiplier, 
or AlU 
result. Results are output on the 32-bit Y bus; a Y output 
multiplexer 
selects the most 


significant 
or least significant 
half of the result if a double-precision 
number is being output. 


To ensure data integrity, 
parity checking 
is performed 
on input data, and parity is generated for output 
data. A master/slave 
comparator 
supports fault-tolerant 
system design. Two test pin control inputs allow 


alii/Os 
and outputs to be forced high, low, or placed in a high-impedance 
state to facilitate 
system testing. 


pipeline controls 


Six data registers in the' ACT884 7 are arranged in three levels along the data paths through the multiplier 
and the AlU. 
Each level of registers can be enabled or disabled independently 
of the other two levels by 


setting the appropriate PIPES2-PIPESOinputs. When enabled, data is latched into the register on the rising 
edge of the system 
clock 
(ClK). 
A separate 
instruction 
pipeline 
register 
stores 
the 
instruction 
bits 


corresponding 
to the operation 
being executed 
at each stage. 


The levels of pipelining are shown in Figure 9. The first set of registers, the RA and RB input registers, 
are controlled 
by PIPESO. These registers may be used as inputs to the AlU, 
multiplier, 
or both. 


The pipeline registers are the second register set. When enabled by PIPES1,these registers latch intermediate 
values in the multiplier 
or AlU. 


The results of the AlU 
and multiplier 
operations 
may optionally 
be latched into two 
output 
registers 
by 


setting PIPES210w. The P (product) register holds the result of the multiplier operation; the S (sum) register 
holds the AlU 
result. 
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Table 3 shows the settings of the registers controlled by PIPES2-PIPESO.The instruction 
pipeline registers 
are set accordingly. 


PIPES2-PIPESO 
REGISTEROPERATION SELECTED 


X 
X 
0 
Enables input registers IRA. RB) 


X 
X 
, 
Makes input registers IRA. RB) transparent 


X 
0 
X 
Enables pipeline registers 


X 
, 
X 
Makes pipeline registers transparent 


0 
X 
X 
Enables output registers IPREG. SREG. Status) 
, 
X 
X 
Makes output registers IPREG. SREG, Status) transparent 


When all registers (except the C register) are enabled, timing 
constraints 
can become critical 
for many 
double-precision 
operations. In clock mode 1, the AlU can perform a double-precision 
operation and output 
a result during every clock cycle, and both halves of the result must be read out before the end of the 
next cycle. 
Status outputs 
are valid only for the period during which the Y output 
data is valid. 


Similarly, double-precision 
multiplication 
is affected by pipelining, clock mode, and sequence of operations. 


A double-precision multiply may require two cycles to execute and two cycles to output the result, depending 
on the settings 
of PIPES2-PIPESO. 


Duration of valid outputs 
at the Y multiplexer 
depends on settings of PIPES2-PIPESOand ClKMODE, 
as 
well as whether 
all operations 
and operands are of the same type. For example, when a double-precision 
multiply 
is followed 
by a single-precision 
operation, one clock cycle must intervene between the dissimilar 
operations. 
The instruction 
inputs are ignored during this clock cycle. 


temporary input register 


A temporary 
input register is provided to enable loading of two 
double-precision 
numbers on two 
32-bit 
input buses in one clock cycle. The contents of the DA bus are loaded into the upper 32 bits of the temporary 
register; 
the contents 
of DB are loaded into the lower 32 bits. 


A clock mode signal (ClKMODE) determines the clock edge on which the data will be stored in the temporary 
register. When ClKMODE 
is low, data is loaded on the rising edge of the clock. With ClKMODE 
set high, 
the temporary 
register loads on a falling edge and the RA and RB registers can then be loaded on the next 
rising edge. The temporary 
register loads during every clock cycle. 


RA and RB input registers 


Two 54-bit registers, RA and RB, are provided to hold input data for the multiplier and AlU. 
Data is taken 
from the DA bus, DB bus, and the temporary 
input register. The registers are loaded on the rising edge 
of ClK 
if the enables ENRA and ENRB are set high. PIPESO must be low. 


Data input combinations 
to the' ACT884 7 vary depending on the precision of the operands and whether 
they are being input as A or B operands. loading 
of external data operands is controlled 
by the settings 
of ClKMODE 
and CONFIG 1-CONFIGO, which determine the clock timing for loading and the registers that 
are used. (See Figure 10). 


configuration controls 


Three input registers are provided to handle input of data operands, either single precision or double precision. 
The RA. RB, and temporary 
registers are each 54 bits wide. The temporary 
register is (ordinarily) 
used 
only during input of double-precision 
operands. 


Double-precision 
operands are loaded by using the temporary 
register to store half of the operands prior 
to inputting the other half of the operands on the DA and DB buses. As shown in Table 4, four configuration 
modes for selecting 
input sources are available for loading data operands into the RA and RB registers. 
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TEMPORARY REGISTER 
MSH 
I 
LSH 


LOADING SEQUENCE 


DATA LOADED INTO TEMP 
DATA LOADED INTO RAJRB 
REGISTERON FIRST CLOCK 
REGISTERSON SECOND 
AND RAJRB REGISTERSON 
CLOCK 
SECOND CLOCKt 


CONFIG1 
CONFIGO 
DA 
DB 
DA 
DB 


0 
0 


B operand 
B operand 
A operand 
A operand 


(MSH) 
(LSH) 
(MSH) 
(LSH) 


0 
1 
A operand 
B operand 
A operand 
B operand 


(LSH) 
(LSH) 
(MSH) 
(MSH) 


1 
0 


A operand 
B operand 
A operand 
B operand 


(MSH) 
(MSHI 
(LSH) 
ILSHI 


1 
1 
A operand 
A operand 
B operand 
B operand 


(MSH) 
(LSH) 
(MSH) 
ILSH)) 


t On the first active clock edge (see Clock Mode Settings), data in this column is loaded into 
the temporary register. On the next rising edge, operands in the temporary register and the 
DAIDB buses are loaded into the RA and RB registers. 


When single-precision 
or integer operands are loaded, the ordinary setting of CONFIG1-CONFIGO is 01, 


as shown 
in Table 5. This setting 
loads each 32-bit 
operand in the most significant 
half (MSH) of its 
respective register. Single-precision operands are loaded into the MSHs and adjusted to double precision 
because the data paths internal to the device are all double precision. 
It is also possible to load single- 


precision operands with other CONFIG settings but two clock edges are required to load both the A and 
B operands on the DA bus, The operands are input as the MSHs of the A and B operands (see Table 4). 
For example, to load single-precision 
operands using CONFIG1-CONFIGO = 10, the A and B operands are 


input one active clock edge before the instruction. 
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DATA 
LOADED INTO 


RA/RB REGISTERS ON 


FIRST CLOCK 


CONFIG1 
CONFIGO 
DA 
DB 
NOTE 


0 
1 
A operand 
8 operand 
This mode is ordinarily 
used for single- 


precision 
operations. 


clock mode settings 


Timing of double-precision data inputs is determined by the clock mode setting, which allows the temporary 
register 
to 
be loaded 
on either 
the 
rising 
edge 
(CLKMODE = 0) or the 
falling 
edge 
of 
the 
clock 


(CLKMODE = 1). Since the temporary register is not used when single-precision operands are input, clock 
modes 0 and 1 are functionally 
equivalent for single-precision 
operations using CONFIG1-CONFIGO = 01. 


The setting 
of CLKMODE can be used to speed up the loading of double-precision 
operands. When the 


CLKMODE input is set high, data on the DA and DB buses are loaded on the falling edge of the clock into 
the MSH and LSH, respectively, 
of the temporary 
register. On the next rising edge, contents 
of the DA 


bus, DB bus, and temporary 
register are loaded into the RA and RB registers, and execution of the current 


instruction 
begins. The setting of CONFIG 1-CONFIGO determines the exact pattern in which operands are 


loaded, whether 
as MSH or LSH in RA or RB. 


Double-precision 
operation in clock mode 0 is similar except that the temporary 
register loads only on a 
rising edge. For this reason, the RA and RB registers do not load until the next rising edge, when all operands 
are available and execution 
can begin. 


A considerable advantage in speed can be realized by performing double-precision operations with CLKMODE 
set high. In this clock mode, both double-precision 
operands can be loaded on successive 
clock edges, 
one falling and one rising. If the instruction 
is an ALU operation, 
then the operation 
can be executed 
in 


the time from one rising edge of the clock to the next rising edge. Both halves of a double-precision 
ALU 


result must be read out on the Y bus within one clock cycle when the' ACT884 7 is operated in clock mode 1. 


The discussion above assumes that the system is able to furnish two sets of operands in one cycle (one 
set on the falling edge of the clock and the other set on the next rising edge). This assumption 
may not 


be valid, since the system 
is required to "double 
pump" 
the input data buses. 


Even for a system that is not able to double pump the input data buses, using clock mode 1 can reduce 
microcode 
size substantially 
resulting in increased system throughput. 
To illustrate, 
take the case of an 
operation where the operand(s) are furnished by one or more of the feedback registers (refer to Table 6). 
Since the input data buses are not being used to furnish the operands, the data on the buses at the time 
of the instruction 
is unimportant. 
By setting 
CLKMODE high, the instruction 
begins after the first cycle, 
resulting 
in a savings of one cycle. 
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TABLE 
6(a) DOUBLE-PRECISION 
CREG + PREG 
USING 
CLKMODE 
= 0, PIPES2-0 
= 010 


DA 
DB 
TEMP 
INSTR 
RA 
RB 
S 
CYCLE 
ClKMODE 
BUS 
BUS 
REG 
BUS 
REG 
REG 
REG 


1 
0 
X 
X 
X 
C + P 
X 
X 
X 


2 
0 
X 
X 
X 
C + P 
X 
X 
X 


3 
X 
X 
X 
X 
X 
X 
X 
C + P 


TABLE 
6(b) 
DOUBLE-PRECISION 
CREG + PREG 
USING 
CLKMODE 
= 0, PIPES2-0 
= 010 


DA 
DB 
TEMP 
INSTR 
RA 
RB 
S 
CYCLE 
ClKMODE 
BUS 
BUS 
REG 
REG 
REG 
REG 
BUS 


1 
1 
X 
X 
X 
C + P 
X 
X 
X 


2 
X 
X 
X 
X 
X 
X 
X 
C + P 


Going one step further, take the case of an operation where only one operand needs to be furnished by 
the input data buses (refer to Table 7). To take advantage of clock mode 1, set the CONFIG lines so that 
the external operand comes directly from the DA and DB bus, as opposed to coming from the temporary 
register. 
Since the temporary 
register is not used to provide an operand, the data latched 
into it is 
inconsequential. 
It naturally 
follows 
then that 
the clock edge used to load the temporary 
register 
is 
unimportant. 
So by setting CLKMODE high, a double-precision instruction will begin after one cycle, instead 
of two cycles. 


TABLE 
7(a) DOUBLE-PRECISION 
PREG + RB 
USING 
CLKMODE 
= 0, PIPES2-0 
= 010 


DA 
DB 
TEMP 
INSTR 
RA 
RB 
S 
CYCLE 
ClKMODE 
BUS 
BUS 
REG 
BUS 
REG 
REG 
REG 


1 
0 
X 
X 
X 
P + RB 
X 
X 
X 


2 
0 
RBIM) 
RBll) 
RB 
P + RB 
X 
RB 
X 


3 
X 
X 
X 
X 
X 
X 
X 
P + RB 


TABLE 
7(b) 
DOUBLE-PRECISION 
PREG + RB 
USING 
CLKMODE 
= 1, PIPES2-0 
= 010 


DA 
DB 
TEMP 
INSTR 
RA 
RB 
S 
CYCLE 
ClKMODE 
REG 
REG 
REG 
BUS 
BUS 
REG 
BUS 


1 
1 
RBIM) 
RBll) 
RB 
P + RB 
X 
RB 
X 


2 
X 
X 
X 
X 
X 
X 
X 
P + RS 


operand selection 


Four multiplexers select the multiplier and ALU operands from the RA and RBregisters, the previous multiplier 
or ALU 
result, 
or the 
C register 
(see Figure 11I. The multiplexers 
are controlled 
by input 
signals 
SELOP7-SELOPOas shown in Tables 8 and 9. For division and square root operations, operands must be 
sourced from the input registers RA and RB. 
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FROM 
C REGISTER- 
- 
FROM PRODUCT - 
- 
REGISTER 


SElOP7-6 


A1 (MUX11INPUT 
S1 (MUX2) INPUT 


SElOP7 
SElOP6 
OPERAND SOURCE' 
SElOP5 
SElOP4 
OPERAND SOURCE' 
0 
0 
Reserved 
0 
0 
Reserved 


0 
1 
C register 
0 
1 
C register 


1 
0 
AlU 
feedback 
1 
a 
Multiplier feedback 


1 
1 
RA input register 
1 
1 
RS input 
register 


A1 (MUX11INPUT 
S1 (MUX2) INPUT 


SElOP7 
SElOP6 
OPERAND SOURCE' 
SElOP5 
SElOP4 
OPERAND SOURCE' 


a 
0 
Reserved 
a 
a 
Reserved 


a 
1 
C register 
a 
1 
C register 


1 
0 
Multiplier feedback 
1 
0 
AlU 
feedback 


1 
1 
RA input 
register 
1 
1 
RB input register 


As shown in Tables 8 and 9, data operands can be selected from five possible sources, including external 
inputs from the RA and RB registers, feedback from the P (Product) and S (Sum) registers, and a stored 
value in the C register. Contents of the C register may be selected as either the A or the B operand in 
the ALU, the multiplier, 
or both. When an external input is selected, the RA input always becomes the 
A operand, and the RB input is the B operand. 


Feedback from the ALU can be selected as the A operand to the multiplier or as the B operand to the ALU. 
Similarly, multiplier feedback may be used as the A operand to the ALU or the B operand to the multiplier. 
During division or square root operations, operands may not be selected except from the RA and RB input 
registers (SELOP7-SELOPO = 11111111 I. 
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Selection of operands also interacts with the selected operation in the ALU or the multiplier. ALU operations 
with one operand are performed only on the A operand (with the exception of the Pass B operation). 
Also, 
depending on the instruction 
selected, the B operand may optionally 
be forced to zero in the ALU or to 
one in the multiplier. 


If an operation uses one or more feedback registers as operands, the unused busIes) can be used to preload 
operand(s) for a later operation. 
The data is loaded into the RA or RB input register(s); 
when the data is 
needed as an operand, the SELOPS pins are set to select the RA or RB register(s), 
but the register input 
enables (ENRA, ENRB) are not enabled. The one restriction 
on preloading data is that the operation 
being 
performed during the preload MUST use the same data type (single-precision, 
double-precision, 
or integer) 
as the data being loaded. Operands cannot 
be preloaded within 
square root or divide instructions. 


C register 


The 54-bit constant 
(Cl register is available for storing the result of an ALU or multiplier operation 
before 
feedback to the multiplier 
or ALU. The C register has a separate clock input (CLKC), input source select 
(SRCC), and write 
enable (ENRC, active low). 


The C register loads from the P or the S register output, 
depending on the setting 
of SRCC. SRCC = 1 
selects the multiplier 
as the input source. Otherwise, 
the ALU is selected when SRCC = 0. The SRCC 
input is not registered with the instruction 
inputs. Depending on the operation 
selected and the settings 
of PIPES2-PIPESO, an offset 
of one or more cycles may be necessary to load the desired result into the 
C register. The register only loads on a rising edge of CLCK when ENRC is low. 
(See Figure 12). 


A separate control 
(FLOWC) is available to bypass the C register when feeding an operand back on the 
C register feedback 
bus. When FLOWC is high, the output 
of the P or S register (as selected by SRCC) 
bypasses the C register without 
affecting 
the C register's 
contents. 
Direct P or S feedback 
is unaffected 
by the FLOWC setting. 


-140------------ 
tdt 
I 
--f 


I 
I 
I 
I 


I!' 
11 


~ 
I 
I 
\--,---- 


The pipelined ALU contains 
a circuit 
for floating-point 
addition 
and/or subtraction 
of aligned operands, 
a pipeline register, an exponent 
adjuster and a normalizer/rounder 
as shown in Figure 13. An exception 
circuit is provided to detect denormal inputs; these can be flushed to zero if the FAST input is set high. 
If the FAST input is low, the ALU accepts a denormal as input. A denorm exception 
flag (DENORM) goes 
high when the ALU output 
is a denormal. 


Integer processing in the ALU includes both arithmetic 
and logical operations on either two's 
complement 
numbers or unsigned integers. The ALU performs addition, subtraction, 
comparison, 
logical shifts, logical 
AND, logical OR. and logical XOR. 
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The ALU may be operated independently 
or in parallel with the multiplier. 
Possible ALU functions 
during 


independent 
operation 
are given in Table 10. 


PREALIGNMENT 
--r-- 


INTEGER ALU 
~-r-~ 


NORMALIZER 
~-r-~ 


SINGLE OPERAND 
TWO OPERANDS 


Pass 
Add 


Move 
Subtract 


Format Conversions 
Compare 


Wrap Denormalized Number 
AND 


Unwrap 
OR 


Shift 
XOR 


pipelined multiplier 


The pipelined multiplier 
(see Figure 14) performs a basic multiply function. 
division and square root. The 


operands can be single-precision or double-precision floating-point numbers and can be converted to absolute 
values before 
multiplication 
takes 
place. Integer operands 
may also be used. Independent 
multiplier 


operations 
are summarized in Table 11. 
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If the operands to the multiplier are double precision or mixed precision (ie. one single precision and one 
double precision), then one extra clock cycle is required to get the product through the multiplier pipeline. 
This means that for PIPES1 = 1, one clock cycle is required for the multiplier 
pipeline; for PIPES1 = 0, 
two 
clock cycles are required for the multiplier 
pipeline. 
1 
RECORDER 
--1-- 


CONVERTER 
--1-- 


ROUNDER 
--1-~ 


TWO OPERANDS 


Multiply 


Divide 


An exception circuit is provided to detect denormalized inputs; these are indicated by a high on the DENIN 
signal. Denormalized inputs must be wrapped by the ALU before multiplication, 
division, or square root. 
If results are wrapped (signaled by a high on the DENORM status pin), they must be unwrapped by the ALU. 


The multiplier and ALU can be operated simultaneously 
by setting the 110 instruction 
input high. Division 
and square root are performed as independent multiplier operations, even though both multiplier and ALU 
are active during divide and SORT operations. 


data output controls 


Selection and duration of results from the Y output multiplexer may be affected by several factors, including 
the operation selected, precision of the operands, registers enabled, and the next operation to be performed. 
The data output 
controls 
are not registered 
with 
the data and instruction 
inputs. 
When the device is 
microprogrammed, 
the effects 
of pipelining and sequencing of operations should be taken into account. 


Two particular conditions 
need to be considered. Depending on which registers are enabled, an offset of 
one or more cycles must be allowed before a valid result is available at the Y output 
multiplexer. 
Also, 
certain sequences of operations may require both halves of a double-precision 
result to be read out within 
a single clock cycle. This is done by toggling 
the SELMS/LS signal in the middle of the clock period. 


When a single-precision 
result is output, 
the SELMS/LS signal has no effect. 
The SELMS/LS signal is set 
low only to read out the LSH of a double-precision 
result (see Figure 15). To read out a result on the Y 
bus, the output 
enable OEY must be low. OEY is an asynchronous 
signal. 
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FROM 
INSTRUCTION 
REGISTER 


parity checker/generator 


When BYTEP is high, internal even parity is generated for each byte of input data at the DA and DB ports 
and compared to the PA and PB parity inputs, respectively. 
If an odd number of bits is set high in a data 
byte, a parity check can also be performed on the entire input data word by setting 
BYTEP low. In this 
mode, PAO is the parity input for DA data and PBO is the parity input for DB data. 


Even parity is generated for the Y multiplexer 
output, 
either for each byte or for each word of output, 
depending on the setting of BYTEP. When BYTEP is high, the parity generator computes four parity bits, 
one for each byte of the Y multiplexer output. 
Parity bits are output on the PY3-PYO pins; PYO represents 
parity for the least significant 
byte. A single parity bit can also be generated for the entire output 
data 
word ~y setting 
BYTEP low. 
In this mode, PYO is the parity output. 


master/slave comparator 


A master/slave comparator is provided to compare data bytes from the Y output multiplexer and the status 
outputs with data bytes on the external Y and status ports when DEY, DES and DEC are high. If the data 
bytes are not equal, a high signal is generated on the master/slave 
error output 
pin (MSERR). 


Figure 16 shows an example master/slave 
circuit. 
Two' 
ACT884 7 slave devices verify the data/status 
integrity 
of the 'ACT884 7 master. 


status and exception generation 


A status and exception 
generator produces several output 
signals to indicate invalid operations 
as well 
as overflow, 
underflow, 
non-numerical 
and inexact results, in conformance 
with 
IEEE Std 754-1985. 
If 
output 
registers are enabled (PIPES2= 0). status and exception 
results are latched in the status register 
on the rising edge of the clock. Status results are valid at the same time as associated data results are valid. 


Duration and availability 
of status results are affected 
by the same timing constraints 
that apply to data 
results on the Y bus. Status outputs are enabled by two signals, DEC for comparison status and DES for 
other status and exception 
outputs. 
Status outputs 
are summarized 
in Tables 12 and 13. 
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0lV 
8847 


(SLAVEI 
nn:= 
STATUS 
Y 


SIGNAL 
RESULT OF COMPARISON (ACTIVE HIGH) 


AEQB 
The A and B operands are equal. A high signal on the AEQB output indicates a 


zero result from the selected source except during a compare operation in the 


ALU. During integer operations, indicates zero status output. 


AGTB 
The A operand is greater than the B operand. 


UNORD 
The two inputs of a comparison operation are unordered. i.e., one or both of 


the inputs is a NaN. 


During a compare operation in the ALU, the AEQB output goes high when the A and B operands are equal. 
When any operation other than a compare is performed, 
either by the ALU or the multiplier, the AEQB 


signal is used as a zero detect. 


COMPARISON 
STATUS OUTPUT 
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SIGNAL 


CHEX 


DIVBYO 


ED 


INEX 


INF 


NEG 


OVER 


RNDCO 


STEXO 


STEX1 


UNDER 


STATUS RESULT 


If 16is low, indicates the multiplier is the source of an exception during a chained 


function. If 16is high, indicates the ALU is the source of an exception during a 


chained function. 


Input to the multiplier is a denorm. When DENINgoes high, the STEX pins indicate 


which port had the denormal input. 


The multiplier output is a wrapped number or the ALU output is a denorm. In the 


FAST mode, this condition causesthe result to go to zero. It also indicates an invalid 


integer operaion, i.e., PASS (-A) with unsigned integer operand. 


An 
invalid 
operation 
involving 
a zero 
divisor 
has 
been 
detected 
by the 
multiplier. 


Exception detect status signal representing logical OR of all enabled exceptions 


in the exception disable register. 


The 
result 
of an operation 
is not 
exact. 


The output is the IEEErepresentation of infinity. Not valid during float-to-integer 


conversions. 


A NaN has been input to the multiplier or the ALU, or an invalid operation 


1(0' 
00) or (+ 
00 - 
00) or (- 
00 + 00)] has been requested. This signal also goes high 


if an operation involves the square root of a negative number. When IVAL goes 


high, the STEX pins indicate which port had the NaN. 


Output value has negative sign. 


The result is greater than the largest allowable value for the specified format. 


The 
mantissa 
of a number 
has 
been 
increased 
in magnitude 
by rounding. 
If the 


number 
generated 
was 
wrapped, 
then 
the 
unwrap 
round 
instruction 
must 
be used 


to properly unwrap the wrapped number (see Table 6). 


The status was generated by the multiplier. (When SRCEXis low, the status was 


generated by the ALU.) 


A NaN or a denorm has been input on the 8 port. 


A NaN or a denorm has been input on the A port. 


The 
result 
is inexact 
and 
less than 
the 
minimum 
allowable 
value 
for the 
specified 


format. In the FAST mode, this condition causes the result to go to zero. 


In chained mode, results to be output are selected based on the state of the 16 (source output) 
pin (if 16 


is low, ALU status will be selected; if 16is high, multiplier status will be selected). If the nonselected output 
source generates an exception, 
CHEX is set high. Status of the nonselected output source can be forced 
using the SELST pins, as shown in Table 14. 


SELST1- 


SELSTO 
STATUS SELECTED 


00 
Logical OR of ALU and multiplier exceptions (bit by bit) 


01 
Selects multiplier status 


10 
Selects ALU status 


11 
Normal operation (selection based on result source specified by 16 input) 
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An exception 
detect 
mask register is available to mask out selected 
exceptions 
from the multiplier, 
ALU, 


or both. Multiply status is disabled during an independent ALU instruction, 
and ALU status is disabled during 


multiplier 
instructions. 
During chained 
operation, 
both status 
outputs 
are enabled. 


When the exception 
mask register has been loaded with 
a mask, the mask is applied to the contents 
of 


the status register to disable unnecessary 
exceptions. 
Status results for enabled exceptions 
are then ORed 


together 
and, if true, the exception 
detect 
(ED) status 
output 
pin is set high (see Figure 18). Individual 


status 
outputs 
remain active 
and can be read independently 
from 
mask register 
operations. 


EXCEPTION 
DETECT MASK 


MULTIPLIER 
AlU 


ED 


FIGURE 18. EXCEPTION 
DETECT 
MASK 
LOGIC 
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absolute maximum ratings over operating free-air temperature range (unless otherwise noted)t 


Supply voltage, 
VCC (see Note 1) 
-0.5 V to 6 V 


Input clamp current, 
11K (VI < 0 or VI > VCC) . . . . . 
± 20 mA 
Output 
clamp current, 
10K 
(VO < 0 or Vo > VCC) 
. 
± 50 mA 


Continuous 
output 
current, 
10 (VO = VCC). 
. . . . . . . 
± 50 mA 


Continuous 
current through 
VCC or GND pins. 
. • . . . 
± 100 mA 


Operating free-air temperature 
range. 
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .. 
0 DC to 70 DC 


Storage temperature 
range 
- 65 DC to 150 DC 


tStresses beyond those listed under "absolute maximum ratings" may cause permanent damage to the device. These are stress ratings 
only and functional operation of the device at these or any other conditions beyond those indicated under "recommended operating 
conditions" 
is not implied. Exposure to absolute-maximum-rated conditions for extended periods may affect device reliability. 
Note 1: All voltage levels are with respect to GND. 


PARAMETER 
MIN 
NOM 
MAX 
UNIT 


VCC 
Supply voltage 
4.75 
5.0 
5.25 
v 


VIH 
High-level input voltage 
2 
VCC 
V 


VIL 
Low-level input voltage 
0 
0.8 
V 


IOH 
High-level output current 
-8 
mA 


IOL 
Low-level output current 
8 
mA 


VI 
Input voltage 
0 
VCC 
V 


Vo 
Output voltage 
0 
VCC 
V 


dt/dv 
Input 
transition 
rise or fall rate 
0 
15 
nslV 


TA 
Operating free-air temperature 
0 
70 
°c 
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electrical characteristics 
over recommended operating free-air temperature range (unless otherwise 
noted) 


PARAMETER 
TEST CONDITIONS 
VCC 
TA - 
25°C 
MIN 
TYP 
MAX 
UNIT 
MIN 
TYP 
MAX 


4.75 V 
4.65 


IOH = 
-20 
p.A 
5.25 V 
5.15 


VOH 
V 
4.75 V 
3.95 
3.85 


10H = 
-8 
mA 
5.25 V 
4.7 
4.6 


4.75 V 
0.1 
10L = 20 p.A 
5.25 V 
0.1 
VOL 
V 
4.75 V 
0.32 
0.45 
10L = 8 mA 
5.25 V 
0.32 
0.45 


VI = VCC or 0, 
5.25 V 
±100 
'ACT8847-30 


II 
VI = VCC or 0, 


p.A 


5.25 V 
±5 
,ACT884 7-40 


VI = VCC or 0, 10 
5.25 V 
±100 
,ACT884 7-30 


10l 
VI = VCC or 0, 10 


p.A 


5.25 V 
±10 
,ACT884 7-40 


VI = VCC or 0, 10 
5.25 V 
10 


ICCO 


'ACT8847-30, 
'ACT8847S-30 
mA 


VI = VCC or 0, 10 
5.25 V 
0.2 
•ACT884 7-40 


CI 
VI = VCC or 0 
5V 
10 
10 
pF 
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switching characteristics over recommended ranges of supply voltage and operating free-air temperature 
(unless otherwise 
noted) 


PIPELINE 
FROM 
TO 
SN74ACT8847-30 
SN74ACT8847-40 


NO. 
PARAMETER 
CONTROLS 
UNIT 
(INPUT) 
(OUTPUT) 
PIPES2-PIPESO 
MIN 
MAX 
MIN 
MAX 


INPUT REG 
Y OUTPUT 
110 
66 
90 
2 
tpd2 
ns 
INPUT REG 
STATUS 
110 
66 
90 


PIPELN REG 
Y OUTPUT 
lOX 
48 
60 
3 
tpd3 
ns 
PIPELN REG 
STATUS 
lOX 
48 
60 


OUTPUT REG 
Y OUTPUT 
OXX 
20 
24 
4 
tpd4 
ns 
OUTPUT REG 
STATUS 
OXX 
20 
24 


5 
tnd5 
SELMS/LS 
Y OUTPUT 
XXX 
13 
20 
ns 


Y OUTPUT 


6 
tpd6 
CLK; 
all but 111 
4 
3 
ns 
INVALID 


STATUS 
7 
tpd7 
CLKl 
all but 111 
2 
3 
ns 


INVALID 


SELMS/LS 


Y OUTPUT 
8 
tpd8 
XXX 
1 .5 
1.5 
ns 
INVALID 


CLK; 
CLK; 
010 w/o feedback 
56 
72 


9 
tdl 
CLK; 
CLK; 
010 w/feedback; 
56 
72 


CLK; 
CLK; 
010 
W/FLOWCt 
66 
84 


CLK; 
CLK; 
000 w/o feedback 
30 
40 
ns 


10 
td2 
CLK; 
CLK; 
000 w/feedback t 
30 
40 


CLK; 
CLK; 
000 WIFLOWCt 
40 
47 


Delay time, CLKC after CLK to insure data 


11 
td3 
captured in C register is data clocked into 
14 
td-O§ 
12 
td-O§ 
sum or product 
register 
by that 
clock. 
ns 


(PIPES2-PIPESO ~ OXXI 


12 
ten 
OEY 
Y OUTPUT 
XXX 
14 
16 


13 
ten2 
OEC, OES 
STATUS 
XXX 
14 
16 
ns 
14 
tdisl 
OEY 
Y OUTPUT 
XXX 
14 
16 


15 
tdis2 
OEC, OES 
STATUS 
XXX 
14 
16 


t Applies to all cases except where operands are fed back using FLOWC to bypass C register. 
tOperands 
are fed back using FLOWC to bypass the C register. 


§t<j is the clock cycle period. 


TEXAS ~ 
INSTRUMENTS 


SN74ACT8847 
64·BIT 
FLOATING-POINT 
UNIT 


timing requirements over recommended ranges of supply voltage and operating free-air temperature 
(unless otherwise noted) 


setup and hold times 


PIPELINE 
SN74ACT8847-30 
SN74ACT8847-40 
NO. 
PARAMETER 
CONTROLS 
UNIT 


PIPES2-PIPESO 
MIN 
MAX 
MIN 
MAX 


16 
tsu1 
Instfcontrol 
before CLK! 
XXO 
12 
14 


17 
tsu2 
OAf DB before CLK! 
XXO 
11 
13 


1B 
tsu3 
OAfDB before 2nd CLK! lOP) 
XX1 
40 
52 


19 
CONFIG1-0 before CLK! 
XXO 
12 
ns 


tsu4 
14 


20 
tsu5 
SRCC before CLKC! 
XXX 
14 
14 


21 
tsu6 
RESET before CLK! 
XXO 
12 
14 


22 
th1 
Instfcontrol 
after CLK! 
XXX 
3 
3 


23 
th2 
OAfDB after CLK! 
XXX 
3 
3 


24 
SRCC after CLKC! 
XXX 
3 
ns 


th3 
3 


25 
tM 
RESET after CLK! 
XXO 
6 
6 


SN74ACTB847-30 
SN74ACT8847-40 
PARAMETER 
UNIT 
MIN 
MAX 
MIN 
MAX 


CLK high 
10 
15 


tw 
Pulse duration 
CLK low 
10 
15 
ns 


RESET 
10 
12 


TEXAS ~ 
INSTRUMENTS 


SN74ACT8847 
64-BIT FLOATING-POINT UNIT 


TEST 
51 


FROMOUTPUT~ 


UNDER TEST 1~.0 


T 


Cl 


TIMING 
Cll 
Vl 
51 
PARAMETER 
IOl 
IOH 


ten 
~ 
50 pF 
1 mA 
-1 
mA 
1.5 V 
CLOSED 


tPlH 


tdis 
~ 
50 pF 
16 mA 
-16 
mA 
1.5 V 
CLOSED 


tplZ 


tpd 
50 pF 
- 
- 
- 
OPEN 


t CL includes 
probe and test fixture 
capacitance. 


NOTE: 
All 
input 
pulses 
are 
supplied 
by 
generators 
having 
the 
following 
characteristics: 


PRR s 1 MHz, 
Zo 
= 50 
0, tr S 6 ns, 
tf 
S 6 ns. 


CLOCKS 
ClK.CLKC 
I 
I 


DATA INPUT {DA31'O 
: 
opo: 
OP2 MSH 


i 
, 


BUSES 
1"'--17 
1-23 
I 
23 


0831.0~, 
OP1 
OP2lSH, 
'-- 
, 
~ 
~_--~_---_---_- 


~ 
:. 
: 
CONFIG1.0 
~ 
CONFIG1,O-OO 
)( 


, 
':: 
: 
~ 
••....22 
I 
ptPES2-0 
~ 
PIP£52·0-000 
l 
" 
ClKMODE :' 
: : 
I 
I 


I 
II 
It 
~' 
ENAA V 
:li 
I 
'}.. 


-16---41~22 
,i tt----16 
•... 22 
'r----'-; 
" 


ENA, 7 
,;::==:;: 
I " 


:' 
" 
I' 


INPUT 
REGISTER 
{RA ===x 
o~ 
~ 


CONTENTS 
I 
I 
J 
RS===X 
OP1 
~ 


{ 


SElOP7.Q 
I XXXX 
1 tl~ 
I 
SElOP7.0 
'. 
tOll 
XXX~I 
XXXX XXXX I 


INSTRUCTIONI 
,.- 
16 
I •... 22 
I "-'6 


CONTROlS 
"0·0 
"AISPI.ftIlISPJ 
fRA1SPI+RBIDP) 
NOP 
, 
' 
, 
' 
CONTROLS , 
, 


,,,,, 
I 
: 
I, 
i 


: 


: 
I 


I 
I' 
SElDP7·Q •• XXXX 1001 
SELOP1·Q •• xxxx 
OtXX 


22 
"-22 
I 
1 t-22 


PREGIDPI + CREGtSP) 
CONVERT DP TO $P 


: I:4:.------~---~- 


~ 
••.•. 22 
,,,,,,, 


NOTES: 
Assume the following 
mixed precision operation. 
Single precision OPO+OPl 
=RA+RB 
- 
SUMl 
- 
CREG, where OPO is SP and OPl 
is SP. 
Mixed precision OP2.0P2 
= RA.RB - 
PRODUCT1, where OP3 is SP and OP2 is DP. 


NOP (must be inserted). 
Mixed precision 
IOP3·0P2) 
+ IOPO+OP1) 
= PREG+CREG - 
SUM2 IDP). and then convert 
to SP. 


Assume 
valid contro' 
signa's for FAST, HALT = 1, P'PES2-0 =000 
(fuliV piDelined model. 
RESET= 1, RND1-0, 
SElSTl-O 
= 11. TP1-0 = 11. 


TEXAS ~ 
INSTRUMENTS 


SN74ACT8847 
64·BIT FLOATING-POINT UNIT 


3" 
~ 
~'? 
, 
I 
I 
I 
I 
I 
I 


{ 


SRCC 
'i 
:I 
; 
i! 
I 
; 


: 


: "'-20=<j 
"-24 
, 
: ""-20-'" 
,....24 
, 


CREG CONTROLS 
~---~---~---~."''L :)' 
I 
I 
Iii 
I 


I --'64 
-- 22 
: I 
I I 
: : 
: : 
: 


FLOWC 
1 
:( 
1 
:\ 
_ 
___________________ 
~: 
:"-16~: .....22 
: 1.-'6-----! 
10=22 
:' 


{ 


MUlTPlPE 
~ 
. 
. 
INTERNAL 
REGISTER 
--- 
•.•.••.----,~--~,~--~----r', 
r~'_-------,----, 
--------- 


CONTENTS 
~l 
I 
I 
I 
'~ 


AlUPlPE 
•.•.••• 
•.•••~~--------------~----- 


{ 


CREG 
: 
I:X 
: 
SUM' 
'5.; 
: 
I 


[ 
':: 
I 
OUTPUT 
REGISTER 
PREG 
:X 
PRODucn 
lOP. X------------- 
CONTENTS 
: 
1 
: 
: 
~--~:---~---~:- 


SAEG 
SUM' (SPI 
5UM2 IOPI 
5UM2 lOP) 


, 
I 
, 
I 


I 
I 
[ 


IL.Jr--ii' 
, I 
I 
, 
I, 
' 
+ ..... 
5 
I 
'II 
.1: 
l 
+'j5 


; ! 
: 
l 
I 


PftOOVCTt 
MSH 
PRODUCTt 
lSH 
5UM2 
MSH 
I 5UM2 
LSH 
.-------.. 
I I 
I 


VALID 
STATUS 
! 


{ 


ll!Y 
:, 
: 


OUTPUT 
Y 
: r-r--, 2 
: 


CONTROLS 
SELMSJa 
•.•.••• 
•.••••• 
~,-~-~---~---~i,J7 


------ 
1 
: 
~.lSn 
1,\:: 


:: 
t 
I 
SUMllSPI 


{ 


Y3'-0 


OUTPUT 
BUSES 


STATUS 
,. 


VALID STATUS 


~4 
! 


NOTES: 
Assume the following mixed precision operation. 


Single precision OPO+OPl 
=-RA+RB 
- 
SUMl 
- 
CREG. where 
OPO is SP and OPl 
is SP. 


Mixed precision OP2.0P2 
~ RA.RB 
- 
PROOUCTl. 
where OP3 is SP and OP2 is OP. 


NOP (must be inserted!. 
Mixed precision 
IOP3.0P21 
+ IOPO+OPll 
~ PREG+CREG - 
SUM2 IOPI. and then convert 
to SP. 


Assume valid control 
signals for FAST. HALT ~ 1. PIPES2·0 ~OOO Ifully 
pipelined model. 
RESET~ 1. RND, -0. SELSTl-O ='1. 
TP1-0 ='1. 


TEXAS ~ 
INSTRUMENTS 


SN74ACT8847 
64·81T FLOATING-POINT 
UNIT 


OP2 MSH 
-" 


DPZ lSH 


, 
, 
" 


{ 


DA31 
0 
OPO LSH 
OPO MSH 


DATA INPUT 
~ 
17 
jIt- 23 
BUSES 
I 
oB31 
0 
OPl 
lSH 
OPl 
MSH 


I 
" 
CONFIG1.0 I"C::O:::N:::F:::'G::,'":.O:"".-;O::',--'v-C::O:::N:::F:::'G::''-:.O:"".-;O::;O--'v- 
__ 
"- 
'v 
J- 
J- 
J- 
.L- 
.L-_ 


:--,6 


PIPES2-Q 
I 


CLKMOOE 0 
I 
I 
I 
I 


-----l.fJ 
I I 
t 
I 


ENRA==,; 
1\ 
I 
I 


! 
i~' 
': 


ENRB 
:1 
l~ 
:\ 
---~.,..-.- 
16 --4Il 
,.....22 
""--'6 
--d-- 22 "- 


INPUT REGISTER { 
RA 
: 
~ 
~PO 
: 


CONTENTS 
: 
V 
! 
V 
RB 
~ 
?P' 
~ 


{ 


SELOP70 


INSTRUCTION' 
"00 
CONTROLS 


CONTROLS 
( 


"r---~---~---~---~- 
it{ 


-.., 
I"- 22 


I: 
I 
I 


: 


: 


SElOP7-0 
- 
1011 XXXX 
XXXX XXXl( 
SElOP7-0 
- 
XXXX 
1001 
.•...22 
'--'6- 
.- 22 


SREG.R8 
NOP 
PREG + CREG 
, 
, 


rAtiO 
CONTR?l 
ASSUMPTI?NS 
Is•• No,e: 


I 
I 
I 
I 


NOTES: 
Assume the following double precision operation. 


OP~ + OP1 ~ RA + RB - 
SUM1 - 
CREG 


IOPO + OP1) • OP2 = SREG • RB - 
PRODUCTl 


[(OP~ + OP1) • OP2}) + IOPO + 0..f.1.L~ PREG + CREG - 
SUM2 
__ 


Assume 
valid control 
signals for FAST, HALT ~ 1, PIPES2·0 ~OOO (fully pipelined model. 
RESET= 1, RND1·0, 
SELSTl·O= 
11, TP1·0 = 11. 


TEXAS 
• 
INSTRUMENTS 


SN74ACT8847 


64-BIT 
FLOATING-POINT 
UNIT 


PARAMETER 
MEASUREMENT 
INFORMATION 


10 
11 
I 
I 
I 
I 
I 
I 
I 
I 
I 
, 
I 
, 
I 
I 
I 
: 


: 
I 
11 
I 
I 
I 
I 
SRce 
I 
I 
I 
I 
:io- 20~~ 
2. 
I 
I 
, 
I 
: 
I 
I 
, 
I 
'""'~."'.{- 


I 
I 
'1. 
:t 
I 
I 
I 
I 


: 


:~161 
~ 22 
I 
I 
I 


: 
: 
: 


Flowe 
i~ 


:J' 
{MUlT PIPE 
~ ~ 22 


INTERNAL 


REGISTER 
CONTENTS 
:~ 
! 
! 
! 
: 
:~ 
: 
AlU 
PIPE 


I 
I 
I 
r 


: 
: 
ir=! 
! 
S~Ml 
: 
: 
: 
OUTPUT 
: 
: 
: 
: 
:X 
PR~DUCT1 
:X 
: 
: 
REGISTER CONTENTS 
PREG 


: 
:X 
: 
: 
:X 
: 
:X 
, 
>C 


SREG 
SUM1 
5UM2 


I 


" 


I 
, 
I 
I 
I 
:~ 
{ 
m 


I 
I~ 
I 
I 
I 
I 
I 


I 
: 
~12 
, 
, 
I 
I 
, 


: 
:\ 


, 
I 
I, 
, r---) 
, 
I 


OUTPUT 
CONTROL 
SELMSI~ 
, 


~ 


I 
Ir- 
A 


" 


, 
I 
, 
+i,.... 5 
, 
: 
-rr-s 
I 
, 
14t-- .. 
I 
, 
I 
=.lln 
I 
I 
I 
, 
I 
_..- 
5 
I 
I 


:: 
1 
I 
I' 


{ 
YJ10 
:1~ 
SU~llSH 
:~)..--l.' 
I 
! }~ 
;ODUCT1MSH 
'7'0OUCT ':S~ 
5UM2 
LSH 


OUTPUT 
BUSES 
" 
I 
~ 
....-4 
15 . 
,,, 


STATUS 
II 
VALID 
STATUS 
, 
VALID STATUS 
VALID STATUS 


-!.l"'13 


NOTES: 
Assume the following double precision operation. 


OPO + OPI 
= RA + RB - 
SUM' 
- 
CREG 


(OPO + OP1) • OP2 = SREG • RB - 
PROOUCTl 


((OPO + OPli 
• OP2)J + IOPO + QE!L= 
PREG + CREG - 
SUM2 
Assume 
valid control 
signals for FAST, HALT = I, 
PIPES2-0 =000 
(fuliV pipelined 
model. 
RESET = " 
RNO, -0, SELSTl-O = 11. TP1-0 = , 1. 


TEXAS ~ 
INSTRUMENTS 


SN74ACT8847 
54·BIT 
FLOATING-POINT 
UNIT 


CONFIG1·0 
( 
: 
I 
I 


: 


P1PES2·0 
~ 
I 
I 


: 
: 


ClKMOOE 
1\ 
, 
I 
I 
I 
I 


I 
I 
I 
I 
I 
, 
~ 


I 
ENRA 
( 
,1\ 
: I; 
: 


I 
I 
I 


I 
I 
I 
~'6~r22 


I 
~ 
'6---., 
I 
22 -;o--tol 
I 
i 
I 
i 
i 
I 
\ 
ENRB V 
: 
' 
: 
I 
I 
I 
I 
I 
I 


RA 
:X 
K IC~NSTANT) 
;X 
OP2 
;X 
OP, 
;X 
OP6 
: 


RB 
: 
;X 
OPO 
:X 


OP, 
:X 
OP3 
:X 
OP5 
:X 
OP7 


SElOP7-0 
XXXX 
11 XX 
10--'6 


110-0 
PASS A 


1011 
XXXX 


22 


SREG 
• RD 


CONTROL !'------r------.-,------.--..:.::==,===:::...:.:::::.:r==='---,-----,------,r----- 
, 


. , 


NOTES: 
Assume the following 
single precision operations. 
(K • OP~) + OP1 = PAODUCT1 + OP1 - 
SUM1 


(K • OP21 + OP3 = PAODUCT2 + OP3 - 
SUM2 
(K • OP4) + OP5 = PAODUCT3 + OP5 - 
SUM3 
(K • OP61 + OP7 = PAODUCT4 + OP7 - 
SUM4 
Assume 
valid 
control 
signals 
for 
FAST, 
HALT=1, 
PIPES2-0=010, 
AESET=1, 
AND1-0, 
SELST1-0=11, 


TP1-0=11. 


TEXAS + 
INSTRUMENTS 


SN74ACT8847 
54·BIT FLOATING-POINT UNIT 


SElMli:S ( 
~---- 
----_----_----~----~~--_ 


im:.OES 


3 
4 
6 
1 
8 


I 
I 
I 
I 
I 


I 
I 
I 
I 
I 
I 


: I 
') 
, 
i 
i 
i 
I 
L20 =:::oi 
10--24 
I 


11. 
:) 
I 
I 
I 
I 


I 
I 
I 
I 


I 
I 
I 
I 
I 1I--'6~ 
10--22 
I 
I 
f" 


I 
I 


I 
I~ 
I 
I 
I 
I 


I 
I 
: 
: J.-'6=J 
I 
I 
I 


: 
~ 
: K ICONSTANTI : 
: 
: 


: 
~ 
PRODUCT1 
;X 
PRODUCT2;X 
PRODUCT3:X 
PRODUCT4~ 
: 
:x K ICONSTANTI:X 
:X 
SUM' 
:X 
SUM2 
:X 
SUM3 :x 
SUM4 
:x 


~" 
; 


I 
I 
I 
I 
I 
I 
I 
I 
I 
I,{ 


: 
: 
; 
; 
;:'4 


I 
I 
\' 
I 
, 
I 
I 
I 
:A' : 
I 
I 
I 
I 
,I 
I 
I 
I 
I 
I 
II 
I 


" 
~ 
:~ 
:~ 
~ 
:~>l-- 
!~ K ICONSTANTIX 
SUM' 
SUM2 
SUM3 
SUM4 
'5 


VALID 
STATUS 
VALID 
STATUS 
VALID 
STATUS 
VALID 
STATUS 
VAllO 
STATUS 


--t*'-'3 
--! I<- 4 
-->t 
,.... 4 
"'4 
100-4 
1"'-4 
~ 
•.... 4 
, 
, 


NOTES: 
Assume the following 
single precision operations. 
(K • OPOI + OP1 = PRODUCT1 + OP1 - 
SUM1 


(K • OP2) + OP3 = PRODUCT2 + OP3 - 
SUM2 
(K • OP41 + OP5 = PRODUCT3 + OP5 - 
SUM3 
(K • OP61 + OP7 = PRODUCT4 + OP7 - 
SUM4 


Assume 
valid 
control 
signals 
for 
FAST, 
HALT=1, 
PIPES2-0=010, 
RESET=1, 
RND1-0, 
SELST1-0=11, 
TP1-0=11. 


TEXAS ~ 
INSTRUMENTS 


SN74ACT8847 
64-BIT 
FLOATING-POINT 
UNIT 


I 


I 
" 
INST:UCTlON: 
FUNCll~,OI. 
RNDI\OI. 
FAST, 
: 
=x 
FIRST OPERANDS 
;K 
SECOND OPERANDS X 
I 
I 
. 
~~~~~~~- 


DATA(31,OI A AND B INPUTS: 
: 


I 
:: 
I 


I 
I 
I 
I 
It------ 
1 
__ 
14-- 
1 
__ 


OUT(31,OI. STATUSl18,Ol 


NOTE: 
Assume 
PIPES2-0~111, 
CONFIG1-0=01, 
ENRA~X, 
ENRB=X, 
SELMS/i:S~X, 
OEY~O, 


OEC =OES ~O, RESET ~ HALT = " 
TP1-0 ~ 11 


FIGURE 23, 
SINGLE-PRECISION 
INDEPENDENT 
ALU 
OPERATION, 
ALL 
REGISTERS 
DISABLED 
(PIPES2-PIPESO 
~ 
111, 
CLKMODE 
- 
Xl 


Load First Operands 
Begin First Operation 
! 


load Second Operands 
Begin Second Operation 
! 


-n'------Il'----- 
ClK 
: 
: 


SECOND 
INSTRUCTION 
I 
' 


~ 
16 ..•••..22~ 
t--- 16 ~ 
22 


INSTRUCTION: FUNCll0,Ol. 
RNDll,OI. 
FAST 


I 
: 


(--O-p-~I~~~~~D-S- 
----------, 
o~~~~~~s 
~ 


__ 17 ..•••.23"; 
~ 
17 •.......•• 
23 


DATAl31 ,01 A AND B INPUTS 
I 
I 
I 
I 
I 


~---R~-'~-G~-T--~ 


I. 
2.1 
l. 
2 
" 


QUTI31,Ol 
STATUSI18,Ol 


FIGURE 24, 
SINGLE-PRECISION 
INDEPENDENT 
ALU 
OPERATION, 
INPUT 
REGISTERS 
ENABLED 
(PIPES2-PIPESO 
- 
110, 
CLKMODE 
- 
Xl 


TEXAS • 
INSTRUMENlS 


SN74ACT8847 
64·BIT 
FLOATING-POINT 
UNIT 


Load First Operands 
Load Second Operands 
Begin First Operation 
Begin Second Operation 
! 
~ 
~ 
n 
_ 
I 
I 
eLK _-----9 
--------01 
I 
I 
I 
' 


FIRST 
INSTRUCTION 
I 
I 
I 
I 
I 
~ 
16 _22~ 
~ 
16 ~22 
I 
I 
I 
I 
I 
I 
INSTRUCTION: FUNC(10,OI. RND(1,OI. FAST 
I 
I 
I 
I 
I 
I 
I 
; 
I 
I 
I 
I 


{ 
Op~I::~OS 
~ 
o~~~~~~S 
~ 
I 
1 
r 
I 
I 
10-17 -'23~ 
~ 
17 ~23 


DATAl31,Ol A AND B INPUTS 
: 


~ 
FIRST RESULT 
-------------------- 
I 
I 
OUTl31,O) STATUSl1a,O) 
1f.--4-4j 


FIGURE 25, 
SINGLE-PRECISION 
INDEPENDENT 
ALU OPERATION, 
INPUT 
AND 
OUTPUT 
REGISTERS 
ENABLED 
(PIPES2-PIPESO 
= 010, CLKMODE 
= XI 


Load Third 
Load Fourth 
Load Fifth 
Operands 
Operands 
Operands 
Load Second 
Operands 
Begin Third 
Begin Fourth 
Begin Fifth 
Load First 
Operation 
Operation 
Operation 
Operands 
Begin Second 
Operation 
Load Pipeline 
Load Pipeline 
Load Pipeline 
Begin First 
Operation 
Load Pipeline 
Load Output 
Load Output 
Load Output 
• 
• 
• 
• 
• 


NOTE: Assume PIPES2-0=OOO, CONFIG1-0=Ol, 
ENRA=l, 
ENRB=l, 
SELMS/LS~X, 
OEY=O, OEC=OES=O, 
RESET=HALT=l,TP1-0='1 


FIGURE 26, 
SINGLE-PRECISION 
INDEPENDENT 
ALU OPERATION, 
ALL REGISTERS 
ENABLED 
(PIPES2-PIPESO 
= 000, CLKMODE 
= XI 


TEXAS + 
INSTRUMENlS 


SN74ACT8847 
64·BIT FLOATING-POINT UNIT 


+ 
--.JI~ 
_ 
I 
II 


(FIRST 
'NS:TRUCTION 


I 
I 
M- 16 -.t 
, 
I 
INSTRUCTION: 
FUNCil0.01. 
RNDil.01. 
FAST 
I 
I 
I 
I 


( 
H~~~F * 
R~_~_~_~_F 
_ 
I 
I 
I 
14-- 17__ 
23 --tl 
I 
DATAi31.01 
A AND B INPUTS 
I 
I 
I 
I 
I 


~ 
:: 
HALF 
REST 
_______ 
I 
I 
FIRST 
I 
_F_'R_S_T 
_ 


I 
I 
I 
I 
M- 
1 -.t 
14- 5 .•• 


OUTi31.01 STATUSllB.OI 


NOTE: Assum~PES2·0~111. 
CLKMODE=O. 
CONFIG1·0=11. 
ENRA=X. 
ENRB=X. 
OEY~O. 


OEe ~OES =0. 
RESET~HALT = 1. TP1·0~ 11 


FIGURE 27. 
DOUBLE-PRECISION 
INDEPENDENT 
ALU OPERATION. 
ALL REGISTERS 
DISABLED 
IPIPES2-PIPESO 
- 
111. 
CLKMODE 
- 
0) 


TEXAS 
~ 
INSTRUMENlS 


SN74ACT8847 
64-81T FLOATING-POINT UNIT 


Load Rest 
of First 
Operands 


Load Rest 
of Second 
Operands 


Load Half 
of First 
Operands 
~ 


Begin First 
Operation 
~ 


Load Half 
of Second 
Operands 
~ 


Begin Second 
Operation 
~ 


I 
, 
""-16 -+l 
I 
I 
INSTRUCTION: 


: 
I 
I 


, 
II 
22 ~1t-16-+f 
I 
•• 


FUNC(10.01. RND(1.01. FAST 
. 
" 
, 
" 


SECOND INSTRUCTION 
THIRD INSTRUCTION 
, 
, 
, 
••.22-tl1f-16 
....•• 
, 
• 
I 
,II 
, 
'II 
' 
I 
II 
' 


II 
17 ...•••......•• 
,..... 17 
I 
23 
I 
I 


II 
~te 
•• It- 17 -+t 
I 23 
I 
I 
I 
I 
I 
I--I~~ 
I 
I 
I 
I 
I 
I 


REST 
HALF 
SECOND 
THIRD 


I: 
I 
I 
••....••. 5 
14- 2 -+t 


FIGURE 28. 
DOUBLE-PRECISION 
INDEPENDENT 
ALU 
OPERATION, 
INPUT 
REGISTERS 
ENABLED 


(PIPES2-PIPESO 
- 
110, 
CLKMODE 
- 
01 


TEXAS • 
INSTRUMENTS 


SN74ACT8847 
64-81T FLOATING-POINT 
UNIT 


load 
Rest 


load 
Rest 
of Second 
of First 
Operands 


Load HaH 
Operands 
Load Hall 
Begin Second 


of First 
Begin First 
of Second 
Operation 


Operands 
Operation 
Operands 
LObd Output 
~ 
L 
~ 
~ 
.J 
I 
I 
I 
I 
I 
ClK 
I 


••• 
11 
., 
I, 
I 
I 
I 


FIRST INSTRUCTION 
SECONDINSTRUCTION 


_I 


I 
I 
I 
II 
I 
--164 
22~k-16_ 


INSTRUCTION: FUNC(10.01. RND(1.01.FAST 
: 


I 
I 
I 


REST 


1ST OP$ 


I 
II 
I 
••..22-.1 ~16--.l 
I 
I 
I 
I 
I 
I 


I 
••.• 22 ..•• 
I 
1I 


I 
II 
I 
II 
.11 .1....17 ....--.l 
k- 


23 
23 


DATA131.01 A AND B INPUTS 


I 
II 
I 
II 
I 
II 
I 
I 


17 ........- 
~ 
23 --.l ~ 
23 -t1 
....-- 
23 ~ 


23 
17 
I 
17 I 
17 
I 
1 
I 
1_--- 
__ 


I 
I 
I 
I 
I 


FIGURE 29. 
DOUBLE-PRECISION 
INDEPENDENT 
ALU 
OPERATl0l". 
INPUT 
AND 
OUTPUT 


REGISTERS 
ENABLED 
(PIPES2-PIPESO 
- 
010. CLKMODE 
- 
1) 


TEXAS 
~ 
INSTRUMENTS 


SN74ACT8847 
64·BIT FLOATING-POINT UNIT 


Load Half 
of First 
Operands 
1 


load 
Rest 
of Second 
load 
Rest 
Operands 
of First 
load 
Half 


Operands 
of Second 
Begin Second 


Begin First 
Operands 
Operation 


Operation 
Load Pipeline 
load 
Output 


~ 
! 
! 


I 
I 
I 
II 
I 
to-16 -.l 
22 *--+l~16 
~ 
I 
I 


INSTRUCTION: FUNC(10,Ol.RND(1,01. FAST 
, 
I 
I 
I 


II 
22~ 
t+- 16 -.l 
I 
I 
I 
I 
I 
I 


I 


22~ 
I 
I 
I 
I 
I 
I 


I 
I 
11 
:1 
I 
II 
t+-17 
__ 
~ 
17~~ 
17~14- 


23 
23 
23 


DATA(31,0) A AND B INPUTS 


II 
I 
II 
17 --+14----+1 t+- 
17 ~ 
\4-- 17 
'I 23 
23 
I 
I 
I 
I 
I 
I 
I 
I 
1 


I 
I 
~ 
4 


FIGURE 30. 
DOUBLE-PRECISION 
INDEPENDENT 
ALU OPERATION. 
ALL REGISTERS ENABLED 
(PIPES2-PIPESO = 000. 
CLKMODE 
= 0) 


TEXAS 
~ 
INSTRUMENTS 


SN74ACT8847 
64-BIT 
FLOATING-POINT 
UNIT 


sample independent multiplier microinstructions' 


The following 
independent multiplier timing diagram examples show five register settings, ranging through 
fully pipelined. 
Examples for divide and square root are included in this section. 
X = don't 
care. 


II 
I, 
INST~CTION: 
FUNCI1~.OI. RND(\OI. 
FAST 
:: 
===x: 
FIRST OPERANDS 
'V 
SECOND OPERANDS X 
I 
I 
A\ 
. 
-------- 
DATA(31.0) 
A AND B INPUTS: 
I 


; 
:: 
I 


I 
I 
I 
I 
It-- 
1 __ 
1+----__+1 


OUT(31.01. STATUS(18.01 


FIGURE 31. 
SINGLE-PRECISION 
INDEPENDENT 
MULTIPLIER 
OPERATION, 
ALL 
REGISTERS 
DISABLED 
(PIPES2-PIPESO 
111, 
CLKMODE 
= XI 


Load First Operands 
Load Second Operands 
Begin First Operation 
Begin Second Operation 
1 
1 
----fI~--~n--- 
C~ 
I 
! 


FIGURE 32. 
SINGLE-PRECISION 
INDEPENDENT 
MULTIPLIER 
OPERATION, 
INPUT 
REGISTERS 
ENABLED 
(PIPES2-PIPESO 
- 
010, 
CLKMODE 
= XI 


TEXAS ~ 
INSTRUMENTS 


SN74ACT8847 
64-BIT 
FLOATING-POINT 
UNIT 


load 
First Operands 
Begin First Operation 
! 
--fI 
Il 
_ 


I 
I 
ClK 
It___---- 
9----~"_ 
I 
I 
I 
, 


FIRST 


INSTRUCTION 


I 
I 
I. 
I 
..- 16_22 ~ 
14- 16 
22 


I 
I 
I 
I 
I 
I 


INSTRUCTION: 
FUNC110.01. RND11.01. FAST 
1 
I 


I 
I 
. 
I 
I 


I 
; 
I 
1 
I 
1 


{ Op~I::~OS~ 
O;~~~~~S~ 


I 
I 
I 
I 
I 
.17 
~23~ 
14- 17 --*-23 


DATA(31.01 
A AND B INPUTS 
: 


~ 
FIRST RESULT 
------------------1 
I 


OUT(31.01 STATUS(1B.O) 
t--4--.l 


FIGURE 
33. 
SINGLE-PRECISION 
INDEPENDENT 
MULTIPLIER 
OPERATION. 
INPUT 
AND 
OUTPUT 
REGISTERS 
ENABLED 
(PIPES2-PIPESO 
- 
010. 
CLKMODE 
- 
XI 


TEXAS • 
INSTRUMENTS 


SN74ACT8847 
54-BIT 
FLOATING-POINT 
UNIT 


load 
Third 
load 
Fourth 
load 
Fifth 
Operands 
Operands 
Operands 
load 
Second 
Operands 
Begin Third 
Begin Fourth 
Begin Fifth 
Operation 
Operation 
Operation 
Begin Second 
Operation 
load 
Pipeline 
load 
Pipeline 
load 
Pipeline 


load 
Pipeline 
load 
Output 
load 
Output 


* 
* 
• 


Begin First 
Operation 


* 


I 
I 
I 
..-4-+l 


FOURTH 
I 
I 
RESULT 


I 
I 
_ 
4--.t 


FIGURE 34. 
SINGLE-PRECISION 
INDEPENDENT 
MULTIPLIER 
OPERATION. 
ALL 
REGISTERS 
ENABLED 
(PIPES2-PIPESO 
= 000. 
CLKMODE 
= X) 


lo.d 
H.ll 


f_o_••_.•_nd_'_~~ 
~r'dP,p,hn, 


, 
I 
, 
, 


, 
I 


< 
FIRST INSTRUCTION 


I 
, 


~16 
.pI 
22 


INSTRUCTION 
FUNCllO.OL 
RNDll.Ol. 
FAST 


I 
i 


K 
l;TA~p; 
t--17--.1>- 
23 


DATAl31,Ol 
A AND BINPUTS 


, 
1.__ 


I 
I 
I 
______________ 
~I 
, 
I 
, 
~~--~-:~-:-~~ 
-OU-'-'3-'-.0-' 
-S,-.-'U-S-,,-.-.O-,-------- 
~ 
3 ----Ill 
~5-----i 


FIGURE 35. 
DOUBLE-PRECISION 
INDEPENDENT 
MULTIPLIER 
OPERATION. 
ALL 
REGISTERS 
DISABLED 
(PIPES2-PIPESO 
- 
111. 
CLKMODE 
= 0) 


TEXAS • 
INSTRUMENTS 


SN74ACT8847 
64·BIT FLOATING-POINT UNIT 


load 
Rest 
of First 
Operands 
load 
Half 
of First 
Begin 
First 
Operands 
Operation 
! 
! 


load 
Rest 
of Second 
Operands 
load 
Pipeline 
I 


load 
Half 
of Second 
Begin 
Second 
Operands 
Operation 
l 
! 


III 
I 
III 
------------------,--- 
, 
l 
I 
I 
_________________ 
~I 
I' 
1------ 
~ 
------------------~------ 
I 
I 
I 
I 
~ 
I+-tt 
3 
5 


:.-- 
16 I 
.: 
I.------*- 22 
I 
I 
, 
INSTRUCTl0':'l: 
FUN<f(10,OI. R~D(1,OI. FAST 


..- 17 ....:..:1..17":'- 
23 
-..: 
23 


DATA(31,OI A AND B INPUTS 


I 
• 
_ 
16--+f 
I 
III 


k-----.l- 22 
I, 
I 


" 
I 
I 
II 
I 
~I 
•• 
1. 
17 
23 17 


FIGURE 36. 
DOUBLE·PRECISION 
INDEPENDENT 
MULTIPLIER 
OPERATION. 
INPUT 
REGISTERS 
ENABLED 
(PIPES2-PIPESO 
- 
110. 
CLKMODE 
- 
1) 


TEXAS • 
INSTRUMENTS 


SN74ACT8847 
54·BIT FLOATING-POINT UNIT 


load 
Rest 
of First 
Operands 
load 
Half 
of Second 
Operands 


load 
Rest 
of Second 
Operands 


Begin Second 
Operation 
load 
Half 
of First 
Operands 
+ 


Begin First 
Operation 
+ 


load 
Pipeline 


+ 


load 
Output 
+ 


, 
, 
...---.r 16 
INSTRUCTION: 


, 
" 
~ 
22 -+I 1+-16 -.t 
FUNC(10.01. RND(1.01. FAST 
; 


, 
, 
I+- 22 -+t 
, 
, 
, 
, 
, 
, 
, 
I 
I 


HALF 
REST 


1ST OP$ 
3RD OPS 


1 
I 
II 
I 
II 
I 
II 
I 
I 
1+-17 __ 
23-.1..-17 
...•... 23-.4 I+- 17_23-+1 
••...17~23-: 


DATA(31.01 A AND B INPUTS 
:: 


, 
I 


" 
~ 


, 
I 


: 
I 


I 
, 
, 
SElMS/LS 
i 
: 
I 
: 
____________________ 
1 
I 
I 
• 
I 


~ 
-------------------~ 
OUT(31.01 STATUS(1a.01 
~4--.l 
:'-5-.l 


FIGURE 
37. 
DOUBLE· PRECISION 
INDEPENDENT 
MULTIPLIER 
OPERATION, 
INPUT 
AND 
OUTPUT 
REGISTERS 
ENABLED 
(PIPES2-PIPESO 
= 010, CLKMODE 
= 0) 


TEXAS • 
INSTRUMENTS 


SN74ACT8847 
64-BIT FLOATING-POINT UNIT 


Load Rest 
Load Half 
Load Rest 
of Second 
of Third 
of First 
Load Half 
Operands 
Operands 


Load Half 
Operands 
of Second 
Begin Second 


of First 
Begin First 
Operands 
Operation 
Load Pipeline 


Operands 
Operation 
Load Pipeline Load Pipeline 
Load Output 
~ 
~ 
~ 
~ 
~ 


I 
I 
I+- 16 -+t 
I 
INSTRUCTION: 


1 


I 
I 
I 
I 
22 _ 
~16'" 
I 
I 
FUNCf10.01. RND11.01.FAST 
I 
' 


I 
I 
22_ 
I 
II 
I 


I.-----..:- 16 
I 
II 
I 


I 
I 
I 
••..•22 -.t 


I 
I 
I 


I 


iii•••• -l.~--~- 
----~------ ------ 
23 
17 
23 
17 


DATA131.01 A AND B INPUTS 


I 
I 
I 
I 
~4-.4 


I 
I 
~5-.4 


FIGURE 38. 
DOUBLE-PRECISION 
INDEPENDENT 
MULTIPLIER 
OPERATION, 
ALL 
REGISTERS 
ENABLED 
(PIPES2-PIPESO 
= 000. 
CLKMODE 
= 01 


FIGURE 39. 
SINGLE-PRECISION 
FLOATING 
POINT 
DIVISION 
(PIPES2-PIPESO 
= 110. CLKMODE 
= Xl 


TEXAS + 
INSTRUMENTS 


SN74ACT8847 
54-BIT 
FLOATING-POINT 
UNIT 


FIGURE 40. 
SINGLE-PRECISION 
FLOATING 
POINT 
DIVISION 
(PIPES2-PIPESO 
= 100. 
CLKMODE 
Xl 


I 


I 
I 
INST~""'UN""~"'~""TE"'~"";P"OI~"'E'D""~_-- 


164! 
16~ 
I 
_22 
. **-22 
I 
I 


Y ~-~-N-D-E<-T~-R-M-I-::-ED-~ 
XLJdLX 
x 
I I 
4...•• 
~ 


FIGURE 41. 
SINGLE-PRECISION 
FLOATING 
POINT 
DIVISION 
(PIPES2-PIPESO 
- 
010. 
CLKMODE 
- 
Xl 


FIGURE 42. 
SINGLE-PRECISION 
FLOATING 
POINT 
DIVISION 
(PIPES2-PIPESO 
- 
000. 
CLKMODE 
- 
Xl 


TEXAS "I 
INSI'RUMENTS 


SN74ACT8847 
64·BIT 
FLOATING-POINT 
UNIT 


I 
____ 
I 
I 
I 
INST-< 
OIVXX8&: ~iE~iM~~~ 
NEXT 
lOP)r 


I 
~22 
16 ~ 
-: 
1--22 


~16 
I 


y =msxxx-U~-NO-~-~-ER-;-;-Eg-O~ 
-I ~3 


NOTE: 
Assume 
PIPES2-0 = 110. CONFIG1-0 
~ 01. 
ENRA = I, ENRB ~ I, OEY = O. OEC = OES = 0, 


RESET = HALT = 
I, TP1-0 = 11 


FIGURE 43. 
DOUBLE-PRECISION 
FLOATING 
POINT 
DIVISION 
(PIPES2-PIPESO 
= 110. 
CLKMODE 
= 0) 


I 
I 


I 
I 
I 
INST-(--O-IV-~-U~~-~-E1~ER-~~~-Eg-§SX~1 
NEXT 
lOP))~I----- 


I 
I 
~22 
16~ 
-: 
~22 
~16 
I 


y ==>&M0~g~Ng~OE~1-E~~M~~N-~-O~----- 


-I ~3 


FIGURE 44. 
DOUBLE-PRECISION 
FLOATING 
POINT 
DIVISION 
(PIPES2-PIPESO 
= 100. 
CLKMODE 
- 
0) 


II 
I 
II 
I 
I 
INST-{--O-IV-~---~-~-O~-T-*-~-IN-~-~---~EXTlOP))1 


14-16 -.l i4-*- 22 
14-16~ 
1..1- 22 
I 


Y~~%o{T~~~~ 


4 --J-ti 


FIGURE 45. 
DOUBLE-PRECISION 
FLOATING 
POINT 
DIVISION 


(PIPES2-PIPESO 
- 
010. 
CLKMODE 
- 
1) 


TEXAS • 
INSTRUMENlS 


SN74ACT8847 
64-BIT 
FLOATING-POINT 
UNIT 


FIGURE 46. 
DOUBLE-PRECISION 
FLOATING 
POINT 
DIVISION, 
ALL 
REGISTERS 
ENABLED 


(PIPES2-PIPESO 
= 000, 
CLKMODE 
= 1I 


FIGURE 47. 
INTEGER 
DIVISION, 
INPUT 
REGISTERS 
ENABLED 
(PIPES2-PIPESO 
= 110, 
CLKMODE 
Xl 


FIGURE 48. 
INTEGER 
DIVISION, 
INPUT 
AND 
PIPELINE 
REGISTERS 
ENABLED 


(PIPES2-PIPESO 
- 
100, 
CLKMODE 
- 
XI 


TEXAS • 
INSTRUMENTS 


SN74ACT8847 
54·BIT 
FLOATING-POINT 
UNIT 


FIGURE 49. 
INTEGER 
DIVISION, 
INPUT 
AND 
OUTPUT 
REGISTERS 
ENABLED 
(PIPES2-PIPESO 
- 
010, 
CLKMODE 
- 
Xl 


FIGURE 
50. 
INTEGER 
DIVISION, 
ALL 
REGISTERS 
ENABLED 
(PIPES2-PIPESO 
- 
000. 
CLKMODE 
- 
Xl 


1 
I 
!SQUARE 
1 
1 
1 ROOT 
I 
I 
INST~U""'~""Dj""ETj"'ER"'~""'I~""Eg""~ 


16~ 
14-1 
I 
16 -.I 
1.--1 
~ 
te- 22 
I 
~ 
14- 22 
I 
I 


y ::=:X«>-Q<~~""~~D-~T~~-~-I~~E"'i~ 
-l 14- 3 


FIGURE 
51. 
SINGLE-PRECISION 
FLOATING 
POINT 
SQUARE 
ROOT. 
INPUT 
REGISTERS 
ENABLED 
(PIPES2-PIPESO 
- 
110. 
CLKMODE 
- 
Xl 


TEXAS ~ 
INSTRUMENTS 


SN74ACT8847 
64-BIT 
FLOATING-POINT 
UNIT 


I 
I 
ISQUARE 
I 
I 
ROOT 
I 


INST~U"";""D~"'ET-iR-~~I~~E-g~---- 
16-.t 
te-I 
16-.1 
J..-I 
22 -.I Ie-- 
-.l It- 22 


I 
I 
Y=>0M"'~N""g"'Ei~E""iM"'~N""~"'D~ 
-I It- 3 


NOTE, 
Assume 
PIPES2-0 
~ 
110. 
CONFIG 1-0 = 01. 
ENRA 
~l ENRB 
~ 
1. SELMSfLS 
~ X. OEY = O. 


OEC 
~ OES = O. RESET = HALT 
= 
1. TP1-0 
~ 
11 


FIGURE 
52. 
SINGLE-PRECISION 
FLOATING 
POINT 
SQUARE 
ROOT. 
INPUT 
AND 
PIPELINE 
REGISTERS 
ENABLED 
(PIPES2-PIPESO 
- 
100. 
CLKMODE 
- 
Xl 


FIGURE 
53. 
SINGLE-PRECISION 
FLOATING 
POINT 
SQUARE 
ROOT. 
INPUT 
AND 
OUTPUT 


REGISTERS 
ENABLED 
(PIPES2-PIPESO 
- 
010. 
CLKMODE 
- 
Xl 


TEXAS • 
INSTRUMENTS 


SN74ACT8847 
64-BIT 
FLOATING-POINT 
UNIT 


FIGURE 
54. 
SINGLE-PRECISION 
FLOATING 
POINT 
SQUARE 
ROOT, 
ALL 
REGISTERS 
ENABLED 
(PIPES2-PIPESO 
= 000, 
CLKMODE 
= XI 


FIGURE 55. 
DOUBLE-PRECISION 
FLOATING 
POINT 
SQUARE 
ROOT, 
INPUT 
REGISTERS 
ENABLED 
(PIPES2-PIPESO 
= 110, 
CLKMODE 
1I 


IIII 


INST i 
S~~~~E 
~~N~EiE~M:~~ 
I 
1~22 
~ 
1f-16 -.I 


I 
I 
, 
I 


NEXT lOP) >"",----- 


~16 
I 
I 
~ 
,.. 22 
I 


y~~NgE!E!MINro~ 
-l It- 3 


FIGURE 
56. 
DOUBLE-PRECISION 
FLOATING 
POINT 
SQUARE 
ROOT, 
INPUT 
AND 
PIPELINE 
REGISTERS 
ENABLED 
(PIPES2-PIPESO 
- 
100, 
CLKMODE 
- 
0) 


TEXAS 
• 
INSTRUMENTS 


SN74ACT8847 
64·BIT 
FLOATING-POINT 
UNIT 


FIGURE 
57. 
DOUBLE-PRECISION 
FLOATING 
POINT 
SQUARE 
ROOT. 
INPUT 
AND 
OUTPUT 


REGISTERS 
ENABLED 
(PIPES2-PIPESO 
- 
010. 
CLKMODE 
- 
11 


FIGURE 
58. 
DOUBLE-PRECISION 
FLOATING 
POINT 
SQUARE 
ROOT. 
ALL 
REGISTERS 
ENABLED 


(PIPES2-PIPESO 
- 
000. 
CLKMODE 
- 
0) 


NOTE: Assume 
PIPES2-0 = 110, 
CONFIG1-0 = 01, 
ENRA = 1, 
SELM/LS = X 
OEY = a 
OEC = OES= 0, RESET= HALT = 1 TP1-0 = 11. The result appears in the SREG. 
' 


FIGURE 59. 
INTEGER 
SQUARE 
ROOT. 
INPUT 
REGISTERS 
ENABLED 
(PIPES2-PIPESO 
- 
110. 
CLKMODE 
- 
XI 


. 
TEXAS ~ 
INSTRUMENTS 


SN74ACT8847 
64-BIT FLOATING-POINT UNIT 


ISQUARE 
I 
I 
ROOT 
I 
INST 
~~~~-~~U~~D-iT-~-R~-IN-iE-~----- 


16.... 
~I 
16-.I It-I 


..., 
t--22 
...., 
~22 
Y~U~Di+iR:*Eg~ 
-I ~3 


FIGURE 60. 
INTEGER 
SQUARE 
ROOT. 
INPUT 
AND 
PIPELINE 
REGISTERS 
ENABLED 
(PIPES2-PIPESO 
= 100. 
CLKMODE 
= XI 


FIGURE 61. 
INTEGER 
SQUARE 
ROOT. 
INPUT 
AND 
OUTPUT 
REGISTERS 
ENABLED 
(PIPES2-PIPESO 
= 010. 
CLKMODE 
= XI 


I SQUARE 
I 
ROOT 
I 
INST 
~-~-N~gE-i-Ei~M~~-N~~D-~------- 


16-.t 
~I 
16 -.l 
1.-1 


--t 
t-- 22 
--t 
~ 
22 
I 
Y~~~~Ng-Ei-TE~~M~~~N~~D~ 


-.l It- 4 


FIGURE 62. 
INTEGER 
SQUARE 
ROOT. 
ALL 
REGISTERS 
ENABLED 
(PIPES2-PIPESO 
= 000. 
CLKMODE 
= XI 


TEXAS ~ 
INSTRUMENTS 


SN74ACT8847 
64-BIT 
FLOATING-POINT 
UNIT 


sample chained mode microinstructions 


The following 
chained m'ode timing diagram examples show four register settings, 
ranging from fully 


flowthrough 
to fully pipelined. 


II 
I, 
INSTRUCTION: FUNC(10.Ol. RND(1.Ol. FAST 
: 
I 
I 
i 
=x 
FIRST OPERA,NDS 
)K---S-EC-O-N-D-O-P-E-:R-A-N-D-S-X~ 
_ 


DATA(31.0) 
A AND B INPUTS: 
i 


; 
:: 
I 


, 
I 
I 
I 
It--- 
1 _ 
I+------.t 


OUTl31.0). 
STATUS(1B.O) 


FIGURE 
63. 
SINGLE-PRECISION 
CHAINED 
MODE 
OPERATION, 
ALL 
REGISTERS 
DISABLED 
(PIPES2-PIPESO 
= 111, 
CLKMODE 
- 
XI 


load 
half 
Load Rest 
load 
Half 
load 
Rest 
load 
Half 
load 
Rest 


of First 
of First 
of Second 
of Second 
of Third 
of Third 
Operands 
Operands 
Operands 
Operands 
Operands 
Operands 
! 
• 
+ 
+ 
! 
! 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
I 
ClK 
I 
I 
I 
I 
I 
I 
I 
I 
I 
, 
I 
I 
I 
, 
, 


FIAST 
INSTRUCTION 
SECOND 
INSTRUCTION 
THIRD 
INSTRUCTION 


I 
, 


14-16_ 


I 
INSTRUCTION:, 


l-- 2H.ll...---..t- 
1 6 
, 
, 


FUNCll0.01. 
RNDll,Ol, 
FAST 


i 
I 


" 
~to-16-./ 
I 22 
I 
I 
I 
I 
I 


I 
II 
II 


14- 17 ~ 
~ 
17 ---.-.t 14- 


23 
:23 


DATA131.01 
A AND B INPUTS 
: 23 


I 


________ 
~ 
FIRST 
~ 
SECOND .x== 
I 
I -------, 
,------- 
........-.t2 
~2 


NOTE: 
Assume 
PIPES2-0 
~ 
110, CONFIG 1-0 
~ 11, ENRA ~ 1, ENRB = 1, SElMS/LS 
~ X, OEY ~ 0, 


oEC 
~ OES ~ 0, RESET ~ HALT 
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FIGURE 
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microprogramming 
the' ACT884 
7 


Because the' ACT884 7 is microprogrammable, 
it can be configured to operate on either integer or single- 


or double-precision 
data operands, 
and the operations 
of the registers, 
ALU, 
and multiplier 
can be 


programmed to support a variety of applications. 
The following 
sections present not only control settings 
but the timings 
of the specific 
operations 
required to execute the sample instructions. 


control inputs 


Control inputs to the' ACT8847 
are summarized 
in Table 15. 


The remaining inputs are discussed in the following 
sections. All control signals and their associated tables 


are also listed in the Reference Guide section to provide a complete, 
easy-to-access 
reference for the 


programmer 
already familiar with 
'ACT8847 
operation. 


SIGNAL 
HIGH 
LOW 


SYTEP 
Selects byte parity generation and test 
Selects single bit parity generation and test 


CLK 
Clocks all registers (except C) on rising edge 
No effect 


CLKC 
Clocks C register on rising edge 
No effect 


CLKMODE 
Enables temporary input register load on falling clock 
Enables temporary input register load on rising clock 


edge 
edge 


CONFIG1- 
See Table 4 (RA and RS register data source selects) 
See Table 40 (RA and RS register data source selects) 


CONFIGO 


ENRC 
No effect 
Enables C register load when CLKC goes high. 


ENRA 
If register is not in flowthrough,enables 
clocking of 
If register is not in flowthrough, 
holds contents of RA 


RA register. 
register. 


ENRS 
If register is not in flowthrough, 
enables clocking of 
II register is not in flowthrough, 
holds contents of RS 


RS register 
register 


FAST 
Places device in FAST mode 
Places device in IEEEmode 


FLOWC 
Causes output value to bypass C register and appear 
No effect 


on C register output bus. 


HALT 
No effect 
Stalls device operation but does not affect registers, 


internal 
states, 
or status. 
C register 
loading 
is not 


disabled 


OEC 
Disables 
compare 
pins 
Enables compare pins 


OES 
Disables status outputs 
Enables status outputs 


OEY 
Disables Y bus 
Enables Y bus 


PIPES2-PIPESO 
See Table 3 (Pipeline Mode Control) 
See Table 3 (Pipeline Mode Control) 


RESET 
No effect 
Clears 
internal 
states, 
status, 
internal 
pipeline 


registers, 
and 
exception 
disable 
register. 
Does 
not 


affect other data registers. 


RND1-RNDO 
See Table 16 (Rounding Mode Control) 
See Table 16 (Rounding Mode Control) 


SELOP7- 
See Tables 8 and 9 (Multiplier/ALU Operand 
See Tables 8 and 9 (Multipler/ALU Operand 


SELOPO 
Selection) 
Selection) 


SELMS/LS 
Selects MSH of 64-bit result for output on the Y bus 
Selects LSH of 64-bit result for output on the Y bus 


(no effect on single-precision operands) 
(no effect on single-precision operands) 


SELST1- 
See Table 14 (Status Output Selection) 
See Table 14 (Status Output Selection) 


SELSTO 


SRCC 
Selects multiplier result for input to C register 
Selects ALU result for input to C register 
TP1-TPO 
See Table 18 (Test Pin Control Inputs) 
See Table 18 (Test Pin Control Inputs) 
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rounding modes 


The' ACT884 7 supports the four IEEEstandard rounding modes: round to nearest, round towards 
zero 
(truncate), round towards infinity (round up). and round towards minus infinity (round down). The rounding 
function 
is selected by control 
pins RND1 and RNDO, as shown in Table 16. 


RND1- 
ROUNDING MODE SELECTED 
RNDO 
o 0 
Round towards nearest 
o 
1 
Round towards zero (truncateI 


1 0 
Round towards infinity (round upl 


1 
1 
Round towards negative infinity (round down) 


Rounding mode should be selected to minimize procedural errors which may otherwise 
accumulate 
and 
affect 
the accuracy of results. Rounding to nearest introduces 
a procedural error not exceeding half of 
the least significant 
bit for each rounding operation. Since rounding to nearest may involve rounding either 
upward 
or downward 
in successive steps, rounding errors tend to cancel each other. 


In contrast, directed rounding modes may introduce errors approaching one bit for each rounding operation. 
Since successive 
rounding operations 
in a procedure may all be similarly directed, 
each introducing 
up 
to a one-bit error, rounding errors may accumulate 
rapidly, especially in single-precision 
operations. 


The device can be programmed to operate in FAST mode by asserting the FAST pin. In the FAST mode, 
all denormalized 
inputs and outputs 
are forced to zero. 


Placing a zero on the FAST pin causes the chip to operate in IEEEmode. In this mode, the ALU can operate 
on denormalized 
inputs and return denormals. 
If a denorm is input to the multiplier, 
the DENIN flag will 
be asserted and the result will be invalid. Denormal numbers must be wrapped before being input to the 
multiplier. 
If the multiplier 
result underflows, 
a wrapped 
number will be output. 


handling of denormalized 
numbers (FAST) 


The FAST input selects the mode for handling denormalized 
inputs and outputs. 
When the FAST input 
is set low, the ALU accepts denormalized inputs but the multiplier generates an exception when a denormal 
is input. When FAST is set high, the DENIN status exception 
is disabled and all denormalized 
numbers, 
both inputs and results, are forced to zero. 


A denormalized input has the form of a floating-point 
number with a zero exponent, 
a nonzero mantissa, 
and a zero in the leftmost 
bit of the mantissa (hidden or implicit bit). A denormalized number results from 
decrementing 
the biased exponent field to zero before normalization 
is complete. 
Since a denormalized 
number cannot be input to the multiplier, 
it must first be converted 
to a wrapped 
number by the ALU. 
When the mantissa of the denormal is normalized by shifting 
it left, the exponent field decrements 
from 
all zeros (wraps past zero) to a negative two's 
complement 
number (except in the case of 0.1 XXX ... ), 
where the exponent 
is not decremented. 


Exponent underflow 
is possible during multiplication 
of small operands even when the operands are not 
wrapped numbers. Setting FAST = 0 selects gradual underflow 
so that denormal inputs can be wrapped 
and wrapped results are not automatically 
discarded. When FAST is set high, denormal inputs and wrapped 
results are forced to zero immediately. 
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When the multiplier is in IEEEmode and produces a wrapped number as its result, the result may be passed 
to the ALU and unwrapped. 
If the wrapped number can be unwrapped 
to an exact denormal, 
it can be 
output without 
causing the underflow 
status flag (UNDER) to be set. UNDER goes high when a result is 
an inexact denormal, 
and a zero is output 
from the FPU if the wrapped 
result is too small to represent 
as a denormal (smaller than the minimum denorm). Table 17 describes the handling of wrapped multiplier 
results and the status flags that are set when wrapped 
numbers are output 
from the multiplier. 


TYPE 
STATUS FLAGS SET 


OF RESULT 
DENORM 
NOTES 
INEX 
RNDCO 


Wrapped, 
1 
0 
0 
Unwrap with 'Wrapped 


exact 
exact' ALU instruction 


Wrapped, 


1 
1 
0 
Unwrap with 'Wrapped 


inexact 
inexact' 
ALU instruction 


Wrapped, 


Unwrap with 'Wrapped 


increased 
in 
1 
1 
1 
rounded' ALU instruction 
magnitude 


When operating in chained mode, the multiplier may output a wrapped result to the ALU during the same 
clock cycle that the multiplier status is output, 
In such a case, the ALU cannot unwrap the operand prior 
to using it, for example, when accumulating the results of previous multiplications, 
To avoid this situation, 


the FPU can be operated in FAST mode to simplify exception handling during chained operations. Otherwise, 
wrapped outputs from the multiplier may adversely affect the accuracy of the chained operation, because 
a wrapped number may appear to be a large normalized number instead of a very small denormalized number. 


Because of the latency associated with interpreting the FPU status outputs and determining how to process 
the wrapped output, 
it is necessary that a wrapped operand be stored external to the FPU (for example, 


in an external register file) and reloaded to the A port of the ALU for unwrapping 
and further processing. 


stalling the device 


Operation of the' ACT884 7 can be stalled nondestructively 
by means of the HALT signal. Bringing the 
HALT input low causes the device to inhibit the next rising clock edge. Register contents 
are unaltered 
when the device is stalled and normal operation 
resumes at the next low clock period after the HALT 
signal is set high. 


Stalling the device does not stall the C register. If ENRC is low, CLKC will clock in data from the source 
selected by SRCC. 


For some operations, 
such as a double-precision 
multiply 
with CLKMODE = 1, setting the HALT input 
low may interrupt 
loading of the RA, RB, and instruction 
registers, as well as stalling operation. 
In clock 
mode 1, the temporary register loads on the falling edge of the clock, but the HALT signal going low would 
prevent the RA, RB, and instruction 
registers from loading on the next rising clock edge. It is therefore 
necessary to have the instruction 
and data inputs on the pins when the HALT signal is set high again and 
normal operation 
resumes. 


The RESETinput is an active-low signal that asynchronously clears the internal states, status, and exception 
disable mask. Internal pipeline registers are cleared, but the RA, RB, and C registers are not. Operation 
resumes when RESET goes high again. 
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test pins 


Two pins, TP1-TPO, support system testing. 
These may be used, for example, to place all outputs 
in a 
high-impedance 
state, isolating the chip from the rest of the system 
(see Table 18). 


TP1- 
OPERATION 
TPO 


0 
0 
All outputs and II0s are forced low 


0 
1 
All outputs and II0s are forced high 


1 
0 
All outputs are placed in a high impedance state 


1 
1 
Normal operation 


independent 
ALU operations 


Configuration and operation of the' ACT884 7 can be selected to perform single- or double-precision floating- 
point and integer calculations 
in operating modes ranging from flowthrough 
to fully pipelined. Timing and 
sequences of operations 
are affected 
by settings 
of clock mode, data and status 
registers, 
input data 
configurations, 
and rounding mode, as well as the instruction 
inputs controlling the ALU and the multiplier. 


Three modes of operation 
can be selected 
with 
inputs 
110-10, including 
independent 
ALU operation, 
independent multiplier operation, or simultaneous 
(chained) operation of ALU and multiplier. 
Each of these 
operating 
modes is treated separately 
in the following 
sections. 


The ALU executes single- and double-precision 
operations which can be divided according to the number 
of operands involved, one or two. Tables 19 and 20 show independent ALU operations with one operand, 
along with the inputs 110-10 which select each operation. 
Conversions from one format 
to another are 
handled in this mode, with the exception of adjustments 
to precision during two-operand 
ALU operations. 


The wrapping 
and unwrapping 
of operands is also done in this mode. 


Most format conversions involve double-precision timing. Conversions between single- and double-precision 
floating-point 
format are treated as mixed-precision 
operations requiring two cycles to load the operands. 
A single-precision 
number is loaded in the upper half (MSH) of its input register. During integer to floating- 
point conversions, 
the integer input should be loaded into the upper half of the RA register. If converting 
from integer to double precision, then two 
cycles are required. 


Logical shifts can be performed on integer operands using the instructions 
shown in Table 20. The data 
operand to be shifted is input from any valid operand source and the number of bit positions the operand 
is to be shifted is input only from the DB bus. The shift number on the DB bus should be in positive 32-bit 
integer format, 
although only the lowest eight bits are used. The shift number cannot be selected from 
sources other than the RB register and the shift number must be loaded on the same cycle as the instruction. 
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TABLE 
19. 
INDEPENDENT 
ALU 
OPERATIONS, 
SINGLE 
FLOATING-POINT 
OPERAND 
(110 = 0,19 = 0,16 
=0,15 
= 1) 


CHAINED 
OPERAND 
PRECISION 
PRECISON 
OUTPUT 
OPERAND 
ABSOLUTE 
ALU 
OPERATION 
OPERATION 
FORMAT 
RA 
RB 
SOURCE 
TYPE 
VALUE 
A 


110 
19 
18 
17 
16 
15 
14 
13-10 
RESULT 
o = 
Not 
0= 
0= 
AISP) 
o ~ BISP) 
0= 
ALU 
1 = 
Single 
0= 
A 
0000 
Pass 
A operand 


Chained 
Floating- 
1 = 
A(DP) 
1 = 
BIDP) 
result 
Operand 
1 = 
IAI 
0001 
Pass 
- A operand 


point 
must 
equal 
0010 
2' s complement 
integer 


18 
to 
floating-point 


conversion 
t 


0011 
Floating-point 
to 
2' s 


complement 
integer 


conversion 
t 


0100 
Move 
A operand 
(pass 


without 
NaN 
detect 
or 


exception 
flags 
active) 


0101 
Pass 
B operand 


0110 
Floating-point 
to 
floating- 


point 
conversiont 


0111 
Floating-point 
to 
unsigned 


integer 
conversion 
t 


1000 
Wrap 
(denormal) 
input 


operand 


1010 
Unsigned 
integer 
to 


floating-point 
conversion 
t 


1100 
Unwrap 
exact 
number 


1101 
Unwrap 
inexact 
number 


1110 
Unwrap 
rounded 
input 


tThe 
precision 
of the integer 
to floating-point 
conversion 
is set by 18. If 18 = 1, the operation 
is timed 
like a double·precision 
operation, 


requiring 
clock 
edges 
to 
load. 


:f.:This converts 
single-precision 
floating-point 
to double-precision 
floating-point 
and vice versa. 
If the 18 pin is low 
to indicate 
a single-precision 


input, 
the result 
of the conversion 
will 
be double 
precision. 
If the 18 pin is high, 
indicating 
a double-precision 
input, 
the result 
of the conversion 


will 
be single 
precision. 
This 
operation 
is timed 
like a double-precision 
operation, 
requiring 
2 clock 
edges 
to load. 


TABLE 
20. 
INDEPENDENT 
ALU 
OPERATIONS, 
SINGLE 
INTEGER 
OPERAND 
(110 = 0,19 = 1,16 = 0,15 = 1) 


CHAINED 
OUTPUT 
OPERAND 


OPERATION 


OPERAND 
FORMAT 
IPRECISION 
ALU 
OPERATION 
SOURCE 
TYPE 


110 
19 
18 
17 
16 
15 
14-10 
RESULT 
o = 
Not 
1 = 
0 
0 
= 
SP 2's 
o = 
ALU 
1 
= 
Single 
00000 
Pass 
A operand 


Chained 
Integer 
complement 
result 
Operands 
00001 
Pass 
I - A) 
operand 


0 
1 = 
SP 
00010 
Negate 
A operand 
11's complement) 


unsigned 
00101 
Pass 
B operand 


integer 
01000 
Shift 
A operand 
left 
logical 
§ 


01001 
Shift 
A operand 
right 
logical 
§ 


01101 
Shift 
A operand 
right 
arithmetic 
§ 


§B operand 
is number 
of 
bit 
positions 
A is to be shifted 
ISee 
instruction 
description 
for 
"Independent 
ALU 
Operations".) 
The 
B operand 


must 
be input 
on the 
same 
cycle 
that 
shift 
is to 
be performed. 
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Tables 21 and 22 present independent ALU operations with two operands. When the operands are different 
in precision, 
one single and the other double, the settings 
of the precision 
selects 
18-17 will identify 
the 
single-precision 
operand so that it can automatically 
be reformatted 
to double-precision 
before the selected 
operation 
is executed, 
and the result of the operation 
will be double precision. 


Precision of each data operand is indicated 
by the setting 
of instruction 
input 
18 for single-operand 
ALU 


instructions, 
or the settings of 18-17 for two-operand 
instructions. 
For single-operand 
instructions, 
17 must 
be set equal to 18. When the ALU receives mixed-precision 
operands (one operand in single precision and 
the other in double precision), 
the single-precision 
data input is converted 
to double and the operation 
is 
executed 
in double precision. 
It is unnecessary 
to use the 'convert 
float-to-float' 
instruction 
to convert 
the single-precision 
operand prior to performing 
the desired operation 
on the mixed-precision 
operands. 
Setting 
18 and 17 properly 
achieves the same effect 
without 
wasting 
an instruction 
cycle. 


Timing for operations 
with mixed-precision 
operands is the same as for a corresponding 
double-precision 
operation. 
In a mixed-precision 
operation, 
the single-precision 
operand must be loaded into the upper half 
of its input register. 
If both operands are single precision, 
a single-precision 
result is output 
by the ALU. 
Operations 
on mixed-precision 
data inputs produce double-precision 
results. 
Mixed-precision 
operations 
do not apply to denormalized 
numbers. 


TABLE 21. INDEPENDENT ALU OPERATIONS, TWO FLOATING-POINT 
OPERANDS 
(110 = 0, 19 = 0,15 
= 0) 


CHAINED 
OPERAND 
PRECISION 
PRECISION 
OUTPUT 
OPERAND 
ABSOLUTE 
ABSOLUTE 
ABSOLUTE 
ALU OPERATION 
OPERATION 
FORMAT 
RA 
RB 
SOURCE 
TYPE 
VALUE A 
VALUE B 
VALUE Y 


110 
19 
18 
17 
16 
15 
14 
13 
12 
11-10 
RESULT 


a = Not 
0= 
0= 
AISP) 
a = BISP) 
0= 
ALU 
a = Two 
a = A 
0= 
B 
a = Y 
00 
A + B 


Chained 
Floating- 
1 = AIDP) 
1 = BIDP) 
result 
Operands 
1 = 
IAI 
1 ~ 
IBI 
1 = 
IVI 
01 
A-B 


point 
10 
Compare 
A. B 


11 
B - 
A 


TABLE 22. INDEPENDENT ALU OPERATIONS, TWO INTEGER OPERANDS 
(110 = 0, 19 - 
1, 16 = 0, 15 = 0) 


CHAINED 
OUTPUT 
OPERAND 


OPERATION 
OPERAND FORMAT/PRECISION 
ALU OPERATION 
SOURCE 
TYPE 


110 
19 
18 
17 
16 
15 
14-10 
RESULT 
a - 
Not 
1 - 
a 
a = SP 2's 
a = ALU 
a = Two 
00000 
A + B 


Chained 
Integer 
complement 
result 
Operands 
00001 
A - 
B 
a 
1 = SP 
00010 
Compare 
A, B 


unsigned 
00011 
B-A 


integer 
01000 
Logical AND lA, B) 


01001 
Logical AND lA, NOT B) 


01010 
Logical AND INOT A, B) 


01011 
Logical AND INOT A, NOT B) 


01100 
Logical OR lA, B) 


01101 
Logical XOR lA, B) 


Two additional independent ALU operations may also be coded, The first of these is for loading the exception 
detect 
mask register, 
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The exception 
detect 
mask register can be loaded with 
a mask to enable or disable selected 
status 


exceptions. 
Status bits for enabled exceptions are logically ORed, and when the result is true, the ED pin 


goes high. During chained operations, 
both multiplier 
and ALU results are ORed. During independent 


operation, 
the nonselected 
status results are forced to zero. 


If the FPU is reset (RESET = 0), the exception 
detect mask register is cleared. Table 23 describes the 


settings for the mask register load instruction 
and the status exceptions which can be enabled or disabled 


with the mask. 


INSTRUCTION 
RESULTS 
INPUTS 


110-17 = 0111 
Exception mask load instruction 


16 
o = Load ALU exception disable register 


1 = Load multiplier exception disable register 


151 
o = IVAL exception enabled 


1 = IVAL exception disabled 


14 
o = OVER exception enabled 


1 = OVER exception disabled 
. 


13 
o = UNDER exception enabled 


1 = UNDER exception disabled 


12 
o = INEX exception enabled 


1 = INEX exception disabled 


11 
o = DIVBYO exception enabled 


1 = DIVBYO exception disabled ~ 


10 
o = DENORM exception enabled 


1 = DENORM exception disabled 


t Disabling IVAL in multiplier exception mask register also disables DENIN 


exception 


~Only significant when 16 = 1 


The second additional independent ALU operation is the Nap (no operation). The table below shows the 
coding for the Nap instruction. This is treated as an integer operation and cannot be used to preload single- 
or double-precision 
floating-point 
operands. 


110-10 


01100000000 


Because Nap, 
in effect, 
just prevents loading of the P or S registers, these registers must be enabled 


(PIPES2= 0) for the Nap to work correctly. 


Timing of a Nap instruction 
is the same as any single-precision 
ALU operation, 
taking one clock cycle 


per pipeline stage that is enabled. For example, when the' ACT884 7 is fully pipelined (PIPES2-PIPESO=000), 
a NaP's effect 
(preventing the overwriting 
of the P and S registers) will be seen on the third cycle. To 


hold the results of an operation on the Y bus for an extra cycle, the Nap instruction 
is inserted directly 


after the instruction 
whose results are to be held. 


The Nap freezes the output 
register's 
contents 
until new results are to be loaded into these registers. 
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independent multiplier operations 


In this mode, the multiplier 
operates 
on two 
of five input 
sources 
which 
can be either single precision, 
double precision, 
or mixed. Multiplication, 
division, 
and square root may be coded as independent 
multiplier 


operations. 


Operand 
precision 
is selected 
by 18 and 17, as for ALU operations. 
The multiplier 
can multiply 
the A and 
B operands, 
either operand with the absolute 
value of the other, or the absolute 
values of both operands. 
The result can also be negated 
when it is output. 
Operations 
involving 
absolute 
value or negated 
results 


are valid only when floating-point 
format is selected. If both operands are single precision, 
a single-precision 


result 
is output. 
Operations 
on mixed-precision 
data inputs 
produce 
double-precision 
results. 


Floating-point 
operands may be normalized or wrapped 
numbers, as indicated by the settings for instruction 


inputs 
11-10. As show~ 
in Table 25, the multiplier 
can be set to operate 
on the absolute 
value of either 


or both floating-point 
operands and the result of any operation 
can be negated when it is output 
from the 


multiplier. 
Converting 
a single-precision 
denormal 
number to double precision 
does not normalize 
or wrap 


the denormal, so it is still an invalid input to the multiplier. 
Independent 
multiplier operations 
are summarized 


in Tables 
25 thru 
27. 


ABSOLUTE 


CHAINED 
OPERAND FORMAT/PRECISION 
OUTPUT 
MULTIPLY/ 
ABSOLUTE 
VALUE B/ 
NEGATE 


OPERATION 
SOURCE 
DIVIDE 
VALUE A 
DIV/SQRT 
RESULT 
WRAP A 
WRAP B 


110 
19 
IB 
17* 
16 
15 
14t 
13t 
12t 
I1t 
lot 


0= 
Not 
0= 
0= 
AISP) 
o = BISPI 
1 = 
0= 
0= 
A 
o = B 
o = y 
0= 
Normal 
0= 
Normal 


Chained 
floating- 
1 = AIDPI 
1 = BIDPI 
Multi- 
multiply 
1 = IAI 
1 = IBI 
1 = -y 
format 
format 


point 
plier 
1 = A is a 
1 = B is a 


result 
1 = 
O=A 
0= 
Div 
wrapped 
wrapped 


1 = 
0 
o = SP 2's 
Div/SORT 
1 = IAI 
1 = SORT 
number 
number 


integer 
complement 


0 
1 = SP 


unsigned 


integer 


t See also Tables 11 and 12. Operations involving absolute values, negated results, or wrapped numbers are valid only when floating·point 


format is selected (19 = 0). 


* For square root operations. 17 must be equal to lB. 


TABLE 26. INDEPENDENT MULTIPLY 
OPERATIONS 
SELECTED BY 14-12 (110 - 
0, 16 = 1, 15 = 0) 


ABSOLUTE 
ABSOLUTE 
NEGATE 


VALUE A 
VALUE B 
RESULT 
OPERATION SELECTED 


14 
13 
12 
14-12 
RESULTSt 


O=A 
0= 
B 
0= 
Y 
000 
A * B 


1 = IAI 
1 = IBI 
1 = -y 
001 
-(A * BI 


010 
A * IBI 


011 
-IA * IBIl 


100 
IAI * B 


101 
-IIAI 
* B) 


110 
IAI * IBI 


111 
-qAI 
* IBI) 


* Operations involving absolute values or negated results are valid only when floating-point 
format 
is selected 119= 0). 
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TABLE 27. INDEPENDENT DIVIDE/SQUARE 
ROOT OPERATIONS 
SELECTED BY 14-12 (110 = 0.16 
= 1.15 
= 1) 


ABSOLUTE 
ABSOLUTE 
NEGATE 


VALUE A 
VALUE B 
RESULT 
OPERATION SELECTED 


14 
13 
12 
14-12 
RESULTSt 


O=A 
o = Divide 
0= 
Y 
000 
A/B 


1 = A 
1 = SQRT 
1 = 
-Y 
001 
-IA 
/ BI 


010 
SQRT A 


011 
-(SQRT 
Al 


100 
IAI/ 
B 
101 
-qAII 
BI 


110 
SQRT IAI 


111 
-ISQRT 
IAII 


10perations 
involving absolute values or negated results are valid only when floating·point 
format 
is selected (19 = 01. 


chained multiplier/ALU 
operations 


In chained mode, the' ACT8847 
performs simultaneous operations in the multiplier and the ALU. Operations 
not only include addition, subtraction, 
and multiplication, 
but also several optional operations which increase 
the flexibility 
of the device (see Table 28). Division and square root operations 
are not available in chained 
mode. Format conversions, 
absolute values, and wrapping 
or unwrapping 
of denormal 
numbers are also 
not available. 


The B operand to the ALU can be set to zero so that the ALU passes the A operand unaltered. The B operand 
to the multiplier 
can be forced to the value 1 so that the A operand to the multiplier 
is passed unaltered. 


Since in chained mode there are four operands but only two bits (18and 17)to select the operand precision, 
care must be taken with 
mixed-precision 
operations. 
The A input to the ALU and to the multiplier 
must 
be of the same precision; just as the B input to the ALU and to the multiplier must be of the same precision. 


NEGATE 
NEGATE 
ALU 
CHAINED 
OPERAND FORMAT/PRECISION 
OUTPUT 
ADD 
MULTIPLY 
ALU 
MULTIPLIER 
OPERATIONS 
OPERATION 
SOURCE 
ZERO 
BY ONE 
RESULT 
RESULT 


110 
19 
18 
17 
16 
15 
14 
131 
121 
11-10 
RESULT 


1 = 
0= 
0= 
AISP) 
o = B(SP) 
o = 
o = 
0= 
o = 
0= 
00 
A+B 


Chained 
floating- 
1 = AIDPI 
1 = B(DP) 
ALU 
Normal 
Normal 
Normal 
Normal 
01 
A - 
B 


point 
result 
operation 
operation 
operation 
operation 
10 
2 - 
A 


1 = 
1 = 
1 = 
1 = 
11 
B - 
A 


1 = 
0 
o = SP 2's 
1 = 
Forces 
Forces 
Negate 
Negate 


integer 
complement 
Multi· 
B2 input 
B1 input 
ALU 
multiplier 


0 
1 = SP 
plier 
of ALU 
of multi· 
result 
result 


unsigned 
result 
to zero 
plier to 


integer 
one 


sample independent ALU microinstructions 


The following 
independent 
ALU timing 
diagram examples show four register settings, 
ranging from fully 
flowthrough 
to fully 
pipelined. 
X = don't 
care. 
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instruction 
timing 


Table 29 details the number of clock cycles required to complete an operation in different 
pipelined modes. 
For more detail, see the sample microinstructions 
shown 
in the previous 
section. 


Clock duration and output delay depend on the pipeline mode selected. See the note in the table and timing 
parameters 
listed at the beginning 
of this document. 


TABLE 29. NUMBER 
OF CLOCKS 
REQUIRED 
TO COMPLETE 
AN OPERATION 


PIPES2-0 - 
000 
PIPES2-0 - 
100 
PIPES2-0 - 
110 
PIPES2-0 - 
010 
OPERATION 
Itpd4) 
(tpd3) 
Itpd2) 
(tpd4) 


Single-Precision 


Floating-Point 


ALU Operation or Multiply:! 
3 
2 
1 
2 


Divide 
8 
7 
7 
8 


Square Root 
11 
10 
10 
11 


Double-Precision 


Floating-Point 


ALU Operation t 
4 
3 
2 
3 


Multiply:! 
5 
4 
3 
4 


Divide 
14 
13 
13 
14 


Square Root 
17 
16 
16 
17 


Integer 


ALU Operation or Multiply:! 
3 
2 
1 
2 


Divide 
16 
15 
15 
16 


Square Root 
20 
19 
19 
20 


Y output and status valid following this tpd delay after the designated number of clocks 
tlncludes every conversion involving double-precision lOP - 
SP or 
DP - 
Integer) 
:!Includes all chained mode operations 
X ~ invalid 


When using fast cycle times and double-precision 
operations, 
two cycles may be required to output 
and 
capture both halves of a double-precision 
result. To insure the result remains valid for two cycles, a NOP 


instruction 
may need to be inserted between the operations. Table 30 shows the number of NOPs necessary 
to insert into the instruction 
stream for fully 
pipelined operation 
(PIPES2-PIPESO = 000). 
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TABLE 
30. 
NOPs 
INSERTED 
TO GUARANTEE 
THAT 
DOUBLE-PRECISION 
RESULTS 
REMAIN 
VALID 
FOR TWO 
CLOCK 
CYCLES 
(PIPES2-PIPESO 
- 
0001 


FOLLOWED By 
# NOPs INSERTED 
# CYCLES RESULT 
1ST OPERATION 
2ND OPERATION 
BETWEEN OPERATIONS 
IS VALID 


DP - 
32 BIT 
DP - 
32 BIT 
0 
2 


32 BIT - 
DP 
0 
2 


32 BIT OP 
0 
1 


DP ALU 
0 
2 


DP Multiply 
0 
2 


DP Sqrt 
0 
2 


DP Divide 
0 
2 


32 BIT - 
DP 
DP - 
32 BIT 
0 
2 


32 BIT - 
DP 
0 
2 


32 BIT OP 
1 
2 


DP ALU 
0 
2 


DP Multiply 
0 
2 


DP Sqrt 
0 
2 


DP Divide 
0 
2 


32 BIT OP 
DP - 
32 BIT 
0 
2 


32' BIT - 
DP 
0 
2 


32 BIT OP 
0 
1 


DP ALU 
0 
2 


DP Multiply 
0 
2 


DP Sqrt 
0 
2 


DP Divide 
0 
2 


DP ALU 
DP - 
32 BIT 
0 
2 


32 BIT - 
DP 
0 
2 


32 BIT OP 
1 
2 


DP ALU 
0 
2 


DP Multiply 
0 
2 


DP Sqrt 
0 
2 


DP Divide 
0 
2 


DP Multiply 
DP - 
32 BIT 
1 
2 


32 BIT - 
DP 
1 
2 


32 BIT OP 
2t 
2 


DP ALU 
1 
2 


DP Multiply 
0 
2 


DP Sqrt 
1 
2 


DP Divide 
1 
2 


NOTE: 32-bit operation refers to a single-precision 
floating-point 
or integer ALU operation or 
multiply, 
except conversion to or from double-precision. 
This assumes the instruction 
following 
a double-precision divide may begin loading on the 12th clock cycle, following 
a double-precision 
square root on the 15th cycle. 


tThe device will not load a single-precision operation on the first clock edge following this operation, 


so any single-precision instruction may be used. A NOP is recommended. The second instruction 
must be a NOP. 
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PROGRAMMING 
INFORMA nON 


TABLE 30. NOPs INSERTED TO GUARANTEE THAT DOUBLE-PRECISION 
RESULTS REMAIN VALID FOR TWO CLOCK CYCLES 
(PIPES2-PIPESO = 000) 
(Concluded) 


FOLLOWED BY 


2ND OPERATION 


DP - 
32 BIT 


32 BIT - 
DP 


32 BIT OP 


DP ALU 


DP Multiply 


DP Sqrt 


DP Divide 


# NOPs INSERTED' 


BETWEEN OPERATIONS 


1 


1 


2t 


1 
a 
a 
a 


1 


1 
2t 


1 
a 
a 
a 


DP - 
32 BIT 


32 BIT - 
DP 


32 BIT OP 


DP ALU 


DP Multiply 


DP Sqrt 


DP Divide 


# CYCLES RESULT 


IS VALID 


2 


2 


2 


2 


2 


2 


2 


2 


2 


2 


2 


2 


2 


2 


NOTE: 32-bit operation refers to a single-precision 
floating-point 
or integer ALU operation or 
multiply, except conversion to or from double-precision. This assumes the instruction 
following a double-precision divide may begin loading on the 12th clock cycle. following 
a double'-precision square root on the 15th cycle. 
tThe device will not load a single-precision operation on the first clock edge following this operation. 
so any single-precision instruction may be used. A NOP is recommended. The second instruction 
must be a NOP. 


exception and status handling 


Exception 
and status flags for the' ACT8847 
were listed previously 
in Tables 12 and 13. 


Output 
exception 
signals are provided to indicate both the source and type of the exception. 
DENORM, 
INEX, OVER, UNDER, and RNDCO indicate the exception 
type, and CHEX and SRCEX indicate the source 


of an exception. 
SRCEX indicates the source of a result as selected 
by instruction 
bit 16, and SRCEX is 


active 
whenever 
a result is output, 
not only when an exception 
is being signalled. 
The chained-mode 


exception 
signal CHEX indicates that an exception 
has be generated by the source not selected for output 


by 16. The exception type signalled by CHEX cannot be read unless status select controls SELST1-SELSTO 
are used to force status 
output 
from the deselected 
source. 


Output 
exceptions 
may be due either to a result in an illegal format 
or to a procedural 
error. Results too 


large or too small to be represented 
in the selected 
precision 
are signalled by OVER and UNDER. When 
INF is high, the output 
is the IEEE representation 
of infinity. 
Any ALU output 
which 
has been increased 


in magnitude by rounding causes INEX to be set high. DENORM is set when the multiplier output is wrapped 
or the ALU output 
is denormalized. 
DENORM is also set high when an illegal operation 
on an integer is 


performed. 
Wrapped outputs 
from the multiplier 
may be inexact or increased in magnitude 
by rounding, 
which 
may cause the INEX and RNDCO status signals to be set high. A denormal output 
from the ALU 


(DENORM = 1) may also cause INEX to be set, in which 
case UNDER is also signalled. 


Ordinarily, 
SELST1-SELSTO are set high so that status selection 
defaults 
to the output 
source selected 


by instruction 
input 16. The ALU is selected as the output 
source when 16 is low, and the multiplier 
when 


16 is high. 
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When the device operates in chained mode, it may be necessary to read the status results not associated 
with the output 
source. As shown in Table 14, SELST1-SELSTO can be used to read the status of either 
the ALU or the multiplier 
regardless of the 16 setting. 


Status results are registered only when the output (P and S) registers are enabled (PIPES2 = 0). Otherwise, 
the status register is transparent. 
In either case, to read the status outputs, the output enables (DES, OEC, 


or both) must be low. 


Status flags are provided to signal both floating-point 
and integer results. Integer status is provided using 
AEQB for zero, NEG for sign, and OVER for overflow/carryout. 


Several status exceptions 
are generated by illegal data or instruction 
inputs to the FPU. Input exceptions 
may cause the following 
signals to be set high: IVAL, DIVBYO, DENIN, and STEX 1-STEXO. If the IVAL 
flag is set, either an invalid operation such as the square root of - IX I, has been requested or a NaN (Not 
a Number) has been input. When DENIN is set, a denormalized 
number has been input to the multiplier. 


DIVBYO is set when the divisor is zero. STEX 1-STEXO indicate which port (RA, RB, or both) is the source 
of the exception 
when either a denormal is input to the multiplier 
(DENIN = 1) or a NaN (lVAL = 1) is 
input to the multiplier 
or the ALU. 


NaN inputs are all treated as IEEE signalling 
NaNs, causing the IVAL flag to be set. When output 
from 
the FPU, the fraction 
field from a NaN is set high (all 1s) and the sign bit is 0, regardless of the original 
fraction 
and sign fields of the input NaN. 


When the' ACT884 7 outputs a NaN, it is always in the form of a signalling NaN along with the IVAL (Invalid) 
and appropriate 
STEX flag set high (except for the MOVE A instruction 
which passes any operand as is 
without 
setting 
exception 
flags). 


Certain operations 
involving 
floating-point 
zeros and infinities 
are invalid, causing the' ACT884 7 to set 
the IVAL flag and output 
a NaN. Operations 
involving 
zero and infinity 
are detailed below. 


When executing 
a floating-point 
to integer conversion 
instruction 
which returns an inexact 8000 
0000, 
the overflow 
flag is set. 


A floating-point 
zero is represented 
by an all zero exponent 
and fraction 
field. The sign bit may be 0 or 
1, to represent 
+0 
OR -0, 
respectively. 


Zero divided by zero is an invalid operation. 
The result is a NaN with the IVAL and DIVBYO flags set. Any 
other number divided by zero results in the appropriately 
signed infinity 
with the DIVBYO flag set. 


For operations 
with floating-point 
zeros: ± 0 multiplied 
by any number is the appropriately 
signed O. 


+0 
+ (-0) 
+0 
+0 
+ (+0) 
+0 


-0 
+ (-0) 
-0 
-0 
+ (+0) 
+0 
+0 
- 
(-0) 
+0 
+0 
- 
(+0) 
+0 
-0 
- 
(-0) 
+0 
-0 
(+0) 
-0 


"1-01 
-0 


Floating-p'oint 
infinity 
is represented 
by an all 1 exponent 
field with 
an all 0 fraction 
field. The sign bit 
determines 
positive 
or negative 
infinity 
(0 or 1 respectively). 


Infinity divided by infinity is an invalid operation, setting the IVAL flag and resulting in a NaN output. Division 
of infinity 
by any other number results in the appropriately 
signed infinity. 
Division of any number (except 
infinity 
or zero) by infinity 
results in an appropriately 
signed zero. Infinity 
divided by zero results in the 
appropriately 
signed infinity 
with the DIVBYO flag set. 
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For invalid operations 
with 
infinity 
listed below, 
the output 
is a signalling 
NaN with 
the IVAL flag set. 


± infinity 
multiplied 
by ± 0 
± infinity 
divided by ± 0 
+ infinity 
+ (- infinity) 


- infinity 
+ (+ infinity) 
+ infinity 
- 
(+ infinity) 


- infinity 
- 
(- infinity) 


Any other number added to or multiplied 
by infinity 
results in the appropriately 
signed infinity 
as output. 


support tools 


Texas Instruments 
has developed functional 
evaluation models of the' ACT884 7 in software 
which permit 
designers to simulate operation 
of the FPU. To evaluate the functions 
of an FPU, a designer can create 
a microprogram 
with sample data inputs, and the simulator will emulate FPU operation to produce sample 
data output files, as well as several diagnostic displays to show specific aspects of device operation. Sample 
microprogram 
sequences are included. 


design support 


Texas Instruments 
Regional Technology 
Centers, staffed with systems-oriented 
engineers, offer a training 
course to assist users of TI LSI products and their application to digital processor systems. Specific attention 
is given to the understanding 
and generation 
of design techniques 
which 
implement 
efficient 
algorithms 
designed to match 
high-performance 
hardware 
capabilities 
with 
desired performance 
levels. 


Information 
on VLSI devices and product support can be obtained from the following 
Regional Technology 
Centers: 


Atlanta 
Texas Instruments 
Incorporated 
3300 
N.E. Expressway, 
Building 8 
Atlanta, 
GA 30341 
404/662-7945 


Boston 
Texas Instruments 
Incorporated 
950 Winter 
Street, 
Suite 2800 
Waltham, 
MA 02154 
617/895-9100 


Northern 
California 
Texas Instruments 
Incorporated 
5353 
Betsy Ross Drive 
Santa Clara, CA 95054 
408/748-2220 


Chicago 
Texas Instruments 
Incorporated 
51 5 Algonquin 
Arlington 
Heights, 
IL 60005 
312/640-2909 


Dallas 
Texas Instruments 
Incorporated 
10001 
E. Campbell Road 
Richardson, 
TX 75081 
214/680-5066 


Southern 
California 
Texas Instruments 
Incorporated 
17891 
Cartwright 
Drive 
Irvine, CA 92714 
714/660-8140 


design expertise 


Texas Instruments 
can provide in-depth technical 
design assistance through 
consultations 
with contract 
design services. Contact your local Field Sales Engineer for current information 
or contact 
Datapath VLSI 
Products 
Systems 
Engineering at 214/997-3970. 
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instruction 
inputs 


Operations 
are summarized 
in Tables 31 thru 39. 


ALU OPERATION 
INSTRUCTION 
1II0TES 


ON A OPERAND 
INPUTS 110-10 


Pass A operand 
OOx x01x 0000 


Pass - A operand 
OOx x01 x 0001 


Convert from 2's 
OOx x010 0010 


complement 
integer 


to floating-point t 
x = Don't care 


Convert from floating- 
OOx x01x 0011 
18 selects precision of A 
point to 2' s complement 
operand 
integert 
0= 
A (SP) 
Move A operand (pass 
OOx x01x 0100 
1 = A (DP) 
without NaN detect or 


status flags active) 
17 selects precision of 8 


operand and must equal 18. 
Pass B operand 
OOx x01x 0101 
14 selects absolute value of 
Convert from floating- 
OOx x01x 0110 
a operand: 


point to floating-point 
0= 
A 
(adjusts precision of 


input: SP - 
DP. DP - 
SP)* 
1 = IAI 


Floating-point to 
During integer to f1oating- 


unsigned 
integer 
point conversion, IA I is not 


conversion 
t 
OOx x01x 0111 
allowed as a result. 


Wrap denormal operand 
OOx xO1x 1000 


Unsigned integer to 
OOx x01x 
1010 


floating-point 


conversion 
t 


Unwrap 
exact 
number 
OOx xO1x 1100 


Unwrap inexact number 
OOx x01x 1101 


Unwrap rounded input 
OOx x01x 1110 


t During 
this operation, 
18 selects 
the precision 
of the result. 
If the conversion 
involves 
doubleprecision, 
the operation requires 2 cycles to load. 


*Requires 2 cycles to load the operation, even if input is SP. 
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ALU OPERATIONS 
INSTRUCTION 
NOTES 
AND OPERANDS 
INPUTS 110-10 


Add A + B 
OOx xOOOOxOO 


Add IAI 
+ B 
OOx xOOl OxOO 
x = Don't Care 
Add A + IBI 
OOx xOOO1xOO 


Add IAI 
+ IBI 
OOx xOOl lxOO 
18 selects precision of A 


operand: 
Subtract A - 
B 
OOx xOOOOxOl 
0= 
A ISP) 
Subtract 
IA I - 
B 
OOx xOOl OxOl 
1 = A lOP) 
Subtract A - 
IBI 
OOx xOOOlxOl 
17 selects precision of B 
Subtract 
IAI 
- 
IBI 
OOx xOOl lxOl 
operand: 
Compare A, B 
OOx xOOOOxl0 
o = B (SP) 


Compare IAI 
' B 
OOx xOOl Oxl0 
1 = B lOP) 


Compare A, IBI 
OOx xOOOlxl0 
12 selects either V or its 


Compare IAI, 
IBI 
OOx xOOl lxl0 
absolute value: 


Subtract B - 
A 
OOx xOOOOx11 
O=V 


Subtract B-1 
A I 
OOx xOOl Oxll 
1 = IVI 


Subtract IBI - 
A 
OOx xOOOlxll 


Subtract IBI - 
IAI 
00xx0011xll 


ALU OPERATION 
INSTRUCTION 


ON A OPERAND 
INPUTS 110-10 
NOTES 


Pass A operand 
010 xOl0 0000 
x = Don't Care 


Pass -A 
operand (2's complement)* 
010 xOl0 0001 
17 selects format of A or B 


Negate A operand (1' s complement) 
010 xOl0 0010 
integer operand: 


Pass B operand 
010 xOl0 0101 
o = Single-precision 
2's 


Shift left logical1 
010 xOl0 
1000 
complement 


Shift right logical1 
010 xOl0 
1001 
1 = Single-precision unsigned 


integer 
Shift right arithmetic 1 
010xOl0ll0l 


1B operand is number of bit positions A is to be shifted and must be input on the same cycle as 
the instruction. 
*Pass (- A) of unsigned integer takes 1's complement. 
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REFERENCE GUIDE 


TABLE 
34. 
INDEPENDENT 
ALU 
OPERATIONS, 
TWO 
INTEGER 
OPERANDS 


ALU OPERATIONS 
INSTRUCTION 
NOTES 
AND OPERANDS 
INPUTS 110-10 


Add A + B 
010 
xOOO 0000 


Subtract A - 
B 
010 
xOOO 0001 
x = Don't Care 


Compare A, B 
010 
xOOO 0010 
17 selects format of A and B 


Subtract B - 
A 
010 
xOOO 001 1 
operands: 


Logical AND A, B 
010 
xOOO 1000 
o = Single-precision 2's 


Logical AND A, NOT B 
010 
xOOO 1001 
complement 


Logical AND NOT A, B 
010 
xOOO 1010 
1 = Single-precision unsigned 


Logical OR A, B 
010 
xOOO 1100 
integer 


Logical XOR A, B 
010 
xOOO 1101 


MULTIPLIER OPERATION 
INSTRUCTION 
NOTES 
AND OPERANDS 
INPUTS 110-10 


Multiply A * B 
OOx xl00 
OOxx 
x = Don't Care 


Multiply 
- (A * B) 
OOx xl00 
01xx 
18 selects A operand 


Multiply A * I BI 
OOx xl00 
10xx 
precision (0 = SP, 1 = DP) 


Multiply 
-(A * IBI) 
OOx xl00 
llxx 
17 selects B operand 
precision (0 = SP, 1 = DP) 
Multiply 
IAI * B 
OOx xl0l 
OOxx 
11 selects A operand format 
Multiply -liA I * B) 
OOx xl0l 
01xx 
(0 = Normal, 1 = Wrapped) 
Multiply 
IAI * IBI 
OOx xl 01 
10xx 
10 selects B operand format 
Multiply 
-(IAI 
* IBI) 
OOx xl0l 
llxx 
(0 = Normal, 1 = Wrapped) 


MULTIPLIER OPERATION 
INSTRUCTION 


AND OPERANDS' 
NOTES 
INPUTS '10-10 


x = Don't Care 


18 selects A operand precision 
Divide A / B 
OOx xll0 
Oxxx 
and 17 selects B operand 


SORT A 
00xxll0lxxx 
precision (0 = SP, 1 = DP) 


Divide IAI / B 
OOx xlll 
Oxxx 
12 negates multiplier result 
(0 = Normal, 1 = Negated) 
SORT IAI 
OOx xlll 
lxxx 
11 selects A operand format and 
10 selects B operand format 
(0 = Normal, 1 = Wrapped) 
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REFERENCE GUIDE 


TABLE 37. 
INDEPENDENT 
INTEGER 
MULTIPLY/DIVIDE/SQUARE 
ROOT 
OPERATIONS 


MULTIPLIER OPERATION 
INSTRUCTION 


AND OPERANDSt 
NOTES 
.INPUTS 110-10 


x = Don't care 
Multiply 
A * 
B 
010 
xl00 
0000 
17 selects 
operand 
format: 


Divide A I B 
010 
xll0 
0000 
o = SP 2's 
complement 


SORT A 
010 
xll0 
1000 
1 = SP unsigned 
integer 


t Operations 
involving 
absolute 
values, 
wrapped 
operands, 
or negated 
results 
are valid only when 
floating-point 
format 
is selected 
(19 = 01. 


CHAINED OPERATIONS 
OUTPUT 
INSTRUCTION 
NOTES 
MULTIPLIER 
ALU 
SOURCE 
INPUTS 110-10 


A * 
B 
A + B 
ALU 
lOx xOOO xxOO 


A*B 
A + B 
Multiplier 
lOx xl00 
xxOO 


A*B 
A - 
B 
ALU 
lOx xOOO xxO 1 


A * 
B 
A - 
B 
Multiplier 
lOx xl00 
xxOl 


A*B 
2 - 
A 
ALU 
lOx xOOO xxl0 
x = Don't 
Care 


A*B 
2 - 
A 
Multiplier 
lOx xl00 
xxl0 
\8 selects precision of 


A * 
B 
B - 
A 
ALU 
lOx xOOO xxll 
RA inputs: 


A * 
B 
B - 
A 
Multiplier 
lOx xl00 
xxll 
0= 
RA ISP) 


A * 
B 
A+O 
ALU 
lOx xOl 0 xxOO 
1 = RA lOP) 


A * 
B 
A+O 
Multiplier 
lOx xll0 
xxOO 
17 selects 
precision 
of 


A * 
B 
O-A 
ALU 
lOx xO 10 xx 11 
RB inputs: 


A * 
B 
0- 
A 
Multiplier 
lOx xll0 
xxll 
o = RB (SP) 


A * 
1 
A + B 
ALU 
lOx xOO 1 xxOO 
1 = RB IDPI 


A*l 
A+B 
Multiplier 
lOx xl0l 
xxOO 
13 negates 
ALU 
result: 


A * 
1 
A - 
B 
ALU 
lOx xOOl 
xxOl 
0= 
Normal 


A*l 
A - 
B 
Multiplier 
lOx xl0l 
xxOl 


A*l 
2 - 
A 
ALU 
lOx xOO1 xx 10 
1 = Negated 


A * 
1 
2 - 
A 
Multiplier 
lOx xl0l 
xxl0 
12 negates 
multiplier 


result: 
A * 
1 
B - 
A 
ALU 
lOx xOOl 
xxll 
o ~ Normal 


A * 
1 
B - 
A 
Multiplier 
10xxl0l 
xxll 
1 = Negated 


A*l 
A+O 
ALU 
lOx xOl1 
xxOO 


A * 
1 
A + 0 
Multiplier 
lOx xlll 
xxOO 


A * 
1 
O-A 
ALU 
lOx xOll 
xxll 


A * 
1 
o - 
A 
Multiplier 
10xxlll 
xxll 


tThe 
110-10 setting 
lxx 
xxlx 
xxl0 
is invalid. 
since 
it attempts 
to force 
the B operand 
of the ALU to both 0 and 
2 simultaneously. 
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CHAINED OPERATIONS 
OUTPUT 
INSTRUCTION 
NOTES 
MULTIPLIER 
AlU 
SOURCE 
INPUTS 110·10 


A * B 
A+B 
ALU 
110 xOOO0000 


A * B 
A + B 
Multiplier 
110 x100 0000 


A * B 
A - 
B 
ALU 
110 xOOO0001 


A * B 
A - 
B 
Multiplier 
110 x100 0001 


A * B 
2 - 
A 
ALU 
110 xOOO0010 


A * B 
2 - 
A 
Multiplier 
110 x100 0010 


A * B 
B - 
A 
ALU 
110 xOOO0011 


A * B 
B - 
A 
Multiplier 
110 xl 00 0011 


x = Don't Care 


A * B 
A+O 
ALU 
110 x010 0000 
17 selects format of A 


A * B 
A+O 
Multiplier 
110 x110 0000 
and B operands: 
A * B 
O-A 
ALU 
110 x010 0011 
o = SP 2's 
A * B 
O-A 
Multiplier 
110x1100011 
complement 


A * 1 
A + B 
ALU 
110 x001 0000 
1 = SP unsigned 


A * 1 
A + B 
Multiplier 
110 x101 0000 
integer 


A * 1 
A - 
B 
ALU 
110 x001 0001 


A * 1 
A - 
B 
Multiplier 
110 x101 0001 


A * 1 
2 - 
A 
ALU 
110 x001 0010 


A * 1 
2 - 
A 
Multiplier 
110x1010010 


A * 1 
B - 
A 
ALU 
110 x001 0011 


A * 1 
B - 
A 
Multiplier 
110x1010011 


A * 1 
A+O 
ALU 
110 x011 0000 


A * 1 
A + 0 
Multiplier 
110x111 
0000 


A * 1 
O-A 
ALU 
110x0110011 


A * 1 
O-A 
Multiplier 
110 x111 xx11 
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input configuration 


CONFIG1-CONFIGO 
control 
the order in which double-precision 
operands are loaded, as shown 
in Table 40. 


LOADING 
SEQUENCE 


DATA LOADED INTO TEMP 
DATA LOADED INTO RAJRB 


REGISTERON FIRST CLOCK 
REGISTERS ON SECOND 
AND RAfRB REGISTERS ON 
CLOCK 
SECOND CLOCK t 


CONFIG1 
CONFIGO 
DA 
DB 
DA 
DB 


0 
0 
B operand 
B operand 
A operand 
A operand 


(MSHI 
(LSH) 
(MSH) 
(LSH) 


0 
1t 
A operand 
B operand 
A operand 
B operand 


(LSHI 
(LSH) 
(MSHI 
(MSHI 


1 
0 
A operand 
B operand 
A operand 
B operand 


(MSH) 
(MSH) 
(LSHI 
(LSH) 


1 
1 
A operand 
A operand 
B operand 
B operand 


(MSH) 
(LSHI 
(MSH) 
(LSH)) 


tOn the first active clock edge (see CLKMODEI. data in this column is loaded into the temporary 


register. 
On the next rising edge. operands 
in the temporary 
register 
and the 
OAf DB buses 
are loaded into the RA and RB registers. 
tUse 
CONFIG1-0 
= 01 as normal single-precision 
input configuration. 


operand source select 


Multiplier 
and ALU operands 
are selected 
by SELOP7-SELOPO 
as shown 
in Tables 
41 and 42. 


A1 (MUX1) 
INPUT 
B1 (MUX2) 
INPUT 


SELOP7 
SELOP6 
OPERAND SOURCEt 
SELOP5 
SELOP4 
OPERAND SOURCEt 


0 
0 
Reserved 
0 
0 
Reserved 


0 
1 
C register 
0 
1 
C register 


1 
0 
ALU feedback 
1 
0 
Multiplier 
feedback 


1 
1 
RA input register 
1 
1 
RB input register 


A2 IMUX31 INPUT 
B2 (MUX4) 
INPUT 


SELOP3 
SELOP2 
OPERAND SOURCEt 
SELOP1 
SELOPO 
OPERAND SOURCEt 


0 
0 
Reserved 
0 
0 
Reserved 


0 
1 
C register 
0 
1 
C register 


1 
0 
Multiplier 
feedback 
1 
0 
ALU feedback 


1 
1 
RA input 
register 
1 
1 
RB input 
register 
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pipeline control 


Pipelining 
levels are turned 
on by PIPES2-PIPESO as shown 
in Table 43. 


PIPES2- 
ROUNDING MODE SELECTED 
PIPESO 


X 
X 
0 
Enables input registers IRA, RB) 


X 
X 
1 
Makes input registers (RA, RB) transparent 


X 
0 
X 
Enables pipeline registers 


X 
1 
X 
Makes pipeline registers transparent 


0 
X 
X 
Enables output registers (PREG, SREG, Status) 


1 
X 
X 
Makes output registers (PREG, SREG, Status) transparent 


round control 


RND1-RNDO select the rounding 
mode as shown 
in Table 44. 


RND1- 
ROUNDING MODE SELECTED 
RNDO 
0 
0 
Round towards nearest 
0 
1 
Round towards zero (truncate) 


1 
0 
Round towards infinity 
(round up) 


1 
1 
Round towards negative infinity (round down) 


status output selection 


SELST1-SELSTO choose the status 
output 
as shown 
in Table 45. 


SELSTl- 
STATUS SELECTED 
SELSTO 


0 
0 
Logical OR of ALU and multiplier exceptions (bit by bit) 
0 
1 
Selects multiplier status 


1 
0 
Selects ALU status 


1 
1 
Normal operation Iselection based on result source specified by 16 input) 


test pin control 


Testing 
is controlled 
by TP1-TPO as shown 
in Table 46. 


TP1- 


TPO 
OPERATION 


0 
0 
All outputs and II0s are forced low 
0 
1 
All outputs and II0s are forced high 


1 
0 
All outputs are placed in a high impedance state 


1 
1 
Normal operation 
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miscellaneous 
control 
inputs 


The remaining 
control 
inputs are shown 
in Table 47. 


SIGNAL 
HIGH 
LOW 


BYTEP 
Selects byte parity generation and test 
Selects single bit parity generation and test 


CLKMODE 
Enables temporary input register load on falling clock 
Enables temporary input register load on rising clock edge 


edge 


ENRC 
No effect 
Enables C register load when CLKC goes high 


ENRA 
If register is not in flowthrough, 
enables clocking of RB 
If register is not in flowthrough, 
holds contents of RA 


register. 
register: 


ENRB 
If register is not in flowthrough, 
enables clocking of RB 
If register is not in flowthrough, 
holds contents of RB 


register. 
register. 


FAST 
Places device in FAST mode 
Places device in IEEEmode 


FLOWC 
Causes output value to bypass C register and appear on 
No effect 


C register output bus. 


HALT 
No effect 
Stalls device operation but does not affect registers, 


internal states, 
or status 
me 
Disables compare pins 
Enables compare pins 


OES 
Disables status outputs 
Enables status outputs 


OEY 
Disables Y bus 
Enables Y bus 


RESET 
No effect 
Clears internal states. status, internal pipeline registers, 


and exception disable register. Does not affect other data 


registers. 


SELMS/LS 
Selects MSH of 54-bit result for output on the Y bus (no 
Selects LSH of 54-bit result for output on the Y bus (no 


effect on single-precision operands) 
effect on single-precision operands) 


SRCC 
Selects multiplier result for input to C register 
Selects ALU result for input to C register 


biased 
exponent 
- 
The true exponent of a floating-point 
number plus a constant 
called the exponent field's 
excess, In IEEEdata format, the excess or bias is 127 for single-precision numbers and 1023 for double-precision 
numbers. 


denormalized 
number 
(denorm) 
- 
A number with an exponent 
equal to zero and a nonzero fraction 
field, 
with the implicit 
leading (leftmost) 
bit of the fraction 
field being O. 


NaN (not 
a number) 
- 
Data that has no mathematical 
value. The 'ACT8847 
produces a NaN whenever 
an 
invalid operation 
such as 0 * 
00 is executed. 
The output 
format 
for an NaN is an exponent 
field of all ones, 
a fraction 
field of all ones, and a zero sign bit, Any number with an exponent ot' all ones and a nonzero fraction 
is treated 
as a NaN on the input, 


normalized 
number 
- 
A number in which the exponent 
field is between 
1 and 254 (single-precision) 
or 1 
and 2046 
(double-precision). 
The implicit 
leading bit is 1. 


wrapped 
number 
- 
A number created by normalizing a denormalized number's fraction 
field and subtracting 
from 
the exponent 
the number of shift 
positions 
required to do so. The exponent 
is encoded 
as a two's 
complement 
negative 
number. 
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SN74ACT8847 
Applications 


Performing fully pipelined double-precision operations requires a detailed understanding 
of timing constraints imposed by the multiplier. In particular, sum of products and 
product of sums operations can be executed very quickly, mostly in chained mode, 
assuming that timing relationships between the ALU and the multiplier are coded 
properly. 


Pseudocode tables for these sequences are provided (Tables 1 and 2) showing how 
data and instructions are input in relation to the system clock. The overall patterns 
of calculations for an extended sum of products and an extended product of sumS 
are presented. Theseexamples assumeFPUoperation in CLKMODE0, with the CONFIG 
setting 
10 
to 
load 
operands 
by 
MSH 
and 
LSH, 
all 
registers 
enabled 
(PIPES2- PIPESO= 000), and the C register clock tied to the system clock. 


In the sum of products timing table, the two 
initial products 
are generated in 
independent multiplier mode. Severaltiming relationships should be noted in the table. 
The first chained instruction loads and begins to execute following the sixth rising 
edge of the clock, after the first product P1 has already been held in the P register 
for one clock. For this reason, P1 is loaded into the C register so that P1 will be stable 
for two clocks. 


On the seventh clock, the ALU pipeline register loads with an unwanted sum, P1 + P1. 
However, because the ALU timing is constrained by the multiplier, the S register will 
not load until the rising edge of CLK9, when the ALU pipe contains the desired sum, 
P1 + P2. The remaining sequence of chained operations then execute in the desired 
manner. 


Table 1. Pseudocode 
for Fully Pipelined Double-Precision 
Sum of Products t 
(CLKMODE-O, 
CONFIG1-CONFIGO-10, 
PIPES2-PIPESO-OOO) 


DA 
DB 
TEMP 
INS 
INS 
RA 
RB 
MUl 
P 
C 
AlU 
S 
y 
ClK 
REG 
PIPE 
REG 
BUS 
BUS 
BUS 
REG 
BUS 
REG 
REG 
REG 
PIPE 
REG 
I 
1 
A1 MSH 
B1 MSH A1,B1MSH 
A1 *B1 
I 
2 
A1 LSH 
B1 LSH 
A1,B1LSH 
A1 *B1 
A1 *B1 
A1 
B1 
I 
3 
A2 MSH 
B2 MSH A2,B2MSH 
A2 *B2 
A1*B1 
A1 
B1 
A1*B1 
I 
4 
A2 LSH 
B2 LSH 
A2,B2LSH 
A2 *B2 
A2 *B2 
A2 
B2 
A1 *B1 


PR+CR 
I 
5 
A3 MSH 
B3 MSH A3,B3MSH 
A2*B2 
A2 
B2 
A2*B2 
P1 
A3*B3 
I 
PR+CR 
PR+CR, 


A2*B2 
6 
A3 LSH 
B3 LSH 
A3,B3LSH 
A3 
B3 
P1 
P1 
A3*B3 
A3*B3 
I 
PR+SR 
PR+SR, 
A3*B3 
P1 
P1 +P1 
7 
A4 MSH 
B4 MSH A4,B4MSH 
A3 
B3 
P2 
A4*B4 
A3*B3 
I 
8 
PR+SR 
PR+SR, 
A3*B3 
P2 
P1 
P1 +P2 
A4 LSH 
B4 LSH 
A4,B4LSH 
A4 
B4 
A4*B4 
A4*B4 
I 
PR+SR 
PR+SR, 
A4*B4 
P2 
Sl +P2 
Sl 
9 
A5 MSH 
B5 MSH A5,B5MSH 
A4 
B4 
P3 
A5*B5 
A4*B4 
I10 
PR+SR 
PR+SR, 
A4*B4 
P3 
P2 
Sl +P3 
Sl 
A5 LSH 
B5 LSH 
A5,B5LSH 
A5 
B5 
A5 *B5 
A5*B5 
III 
A6 MSH 
B6 MSH A6,B6MSH 
PR+SR 
PR+SR, 
A5*B5 
P4 
P2 
xxx xx 
S2 
A6*B6 
A5*B5 
A5 
B5 
I12 
P4 
P2 
S2 


tpR = Product 
Register 
SR = Sum Register 
CR = Constant 
IC) Register 


Table 2. 
Pseudocode 
for Fully Pipelined Double-Precision 
Product 
of Sums t 
(CLKMODE •••0, CONFIG1-CONFIGO •••10, PIPES2-PIPESO •••0001 


DA 
DB 
TEMP 
INS 
INS 
RA 
RB 
MUL 
P 
C 
ALU 
S 
y 
CLK 
REG 
REG 
PIPE 
REG 
BUS 
BUS 
BUS 
REG 
BUS 
REG 
REG 
REG 
PIPE 
I 
1 
A1M5H 
B1M5H 
A1,B1M5H 
A 1 + B1 
I 
2 
A1L5H 
B1L5H 
A 1,B1 L5H 
A1+B1 
A1+B1 
A1 
B1 
I 
3 
A2M5H 
B2MSH 
A2,B2M5H 
A2+B2 
A1+B1 
A1 
B1 
A1 +B1 
I 
4 
A2L5H 
B2L5H 
A2,B2L5H 
A2+B2 
A2+B2 
A2 
B2 
A1+B1 
51 
I 
CR *5R 
ENRC =0 
A2+B2 
51 
5 
A3M5H 
B3M5H 
A3,B3M5H 
A2+B2 
A2 
B2 
A3+B3 
51 
I 
CR*5R 
CR*5R 
51 
A2+B2 
52 
6 
A3L5H 
B3L5H 
A3,B3L5H 
A3 
B3 
A3+B3 
A3+B3 
I 
CR*5R 
51 *52 
51 
A3+B3 
52 
7 
XXX 
XXX 
XXX 
NOP 
A3 
B3 
A3+B3 
I 
PR*5R 
ENRA=O 
ENRB=O 
xxx 
8 
A4M5H 
B4M5H 
A4,B4M5H 
NOP 
51 *52 
51 
A4+B4 
A3 
B3 
I 
PR*5R 
PR*5R 
53 
9 
A4L5H 
B4L5H 
A4,B4LSH 
A4 
B4 
XXX 
P1 
51 
XXX 
A4+B4 
A4+B4 
I10 
PR*5R 
51 
A4+B4 
53 
XXX 
XXX 
XXX 
NOP 
A4 
B4 
P1 *53 
P1 
A4+B4 
III 
PR*5R 
ENRA=O 
ENRB=O 
P1 *53 
51 
A4+B4 
XXX 
A5M5H 
B5M5H 
A5,B5M5H 
NOP 
XXX 
A5+B5 
A4 
B4 


I12 
PR*5R 
PR*5R 
54 
A5L5H 
B5LSH 
A5,B5LSH 
A5 
B5 
XXX 
P2 
51 
X 
A5+B5 
A5+B5 


NOTE: 
NOP instruction 
is 011 
00000000. 


tpR = Product 
Register 


SR = Sum Register 
CR = Constant 
(C) Register 


Matrix Operations 


The' ACT884 7 floating-point unit can also be used to perform matrix manipulations 
involved in graphics processing or digital signal processing. The FPU multiplies and 
adds data elements, executing sequences of microprogrammed calculations to form 
new matrices. 


Representation of Variables 


In state representations of control systems, an n-th order linear differential equation 
with constant coefficients 
can be represented as a sequence of n first-order linear 
differential equations expressed in terms of state variables: 


dxl 
_ 
-- 
- 
x2,···, 
dt 
dX(n-1) 
dt 


For example, in vector-matrix 
form the equations of an nth-order system can be 
represented as follows: 


or, 
X = ax + bu 


Expanding the matrix equation for dne state variable, dx 1/dt, results in the following 
expression: 


Xl 
= (all * Xl + ... 
+ al n * xn) + (bll * ul 
+ ... 
+ bl n * un) 


where Xl 
= dx 1/dt. 


Sequences of multiplications 
and additions are required when such state space 
transformations are performed, and the' ACT884 7 has been designed to support such 
sum-of-products operations. An n x n matrix A multiplied by an n x n matrix X yields 
an n x n matrix C whose elements cij are given by this equation: 


n 


Cij = ~ 
aik * Xkj 
for i = 1, ... 
,n 
j = 1, ... 
,n 


k=l 


'For the Cij elements to be calculated by the' ACT884 7, the corresponding elements 
aik and Xkj must be stored outside the 'ACT884 7 and fed to the 'ACT884 7 in the 
proper order required to effect a matrix multiplication 
such as the state space system 
representation just discussed. 


Sample Matrix Transformation 


The matrix manipulations commonly performed in graphics systems can be regarded 
as geometrical transformations of graphic objects. A matrix operation on another matrix 
representing a graphic object may result in scaling, rotating, transforming, 
distorting, 
or generating a perspective view of the image. By performing a matrix operation on 
the position vectors which define the vertices of an image surface, the shape and 
position of the surface can be manipulated. 


The generalized 4 x 4 matrix 
for transforming 
a three-dimensional 
object 
with 
homogeneous ,coordinates is shown below: 


a 
b 
c 
d 
e 
f 
g 
h 
i 
j 
k 
I 
.. 
m 
n 
0 
p 


The matrix T can be partitioned into four component matrices, each of which produces 
a specific effect on the resultant image: 


3 
3 x 3 
x 
1 


The 3 x 3 matrix produces linear transformation 
in the form of scaling, shearing and 
rotation. The 1 x 3 row matrix produces translation, while the 3 x 
1 column matrix 
produces perspective transformation 
with multiple vanishing points. The final single 
element 1 x 1 produces overall scaling. Overall operation of the transformation matrix 
T on the position vectors of a graphic object produces a combination 
of shearing, 
rotation, 
reflection, 
translation, 
perspective, and overall scaling. 


The rotation of an object about an arbitrary axis in a three-dimensional space can be 
carried out by first translating the object such that the desired axis of rotation passes 
through the origin of the coordinate system, then rotating the object about the axis 


through the origin, and finally translating the rotated object such that the axis of rotation 
resumes its initial position. If the axis of rotation passesthrough the point P = [a b c 11, 
then the transformation 
matrix is representable in this form: 


[x y z 11 
1 
0 
o 
1 
o 
0 
-a 
-b 


o 
0 
o 
0 
1 
0 
-c 
1 


translation 
to origin 
rotation 
about 
origin 


translation 
back to initial 
position 


direction cosine for x-axis of 
rotation 


A general rotation using equation (2) is effected by determining the [x y zl coordinates 
of a point A to be rotated on the object, the direction cosines of the axis of rotation 
[n1, n2, n31, and the angle ¢ of rotation about the axis, all of which are needed to 


define matrix [R]. Suppose, for example, that a tetrahedron ABCD, represented by 
the coordinate matrix below is to be rotated about an axis of rotation RX which passes 
through a point P = [5 - 6 3 1] and whose direction cosines are given by unit vector 
[n1 = 0.866, 
n2 = 0.5, 
n3 = 0.707]. 
The angle of rotation 0 is 90 degrees (see 
Figure 1). The rotation matrix [R] becomes 


2 
-3 
3 1 
A 
1 -2 
2 1 
B 
2 -1 
2 1 
C 
2 
-2 
2 
1 
D 


0.750 
1.140 
0.112 
0 


R 
-0.274 
0.250 
1.220 
0 
1.112 
-0.513 
0.500 
0 
0 
0 
0 
1 


y 


r-- 
I 
BT 
(211 
AT 
I 
x· I 
IL _ ....• 
BR 
I 
I 
C' 
I 
(31 
L 
....• 
B' 
t- 


(1) 
THIS ARROW DEPICTS THE FIRST TRANSLATION 
(2) 
THIS ARDW 
DEPICTS THE 90° 
ROTATION 
(3) 
THIS ARROW DEPICTS THE BACK TRANSLATION 


The point transformation equation (2) can be expanded to include all the vertices of 
the tetrahedron as follows: 


xa 
ya 
za 
h1 


xb 
yb 
zb 
h2 
h3 
h4 


xc 
yc 


xd 
yd 


2-3 
3 1 
1 -2 
2 1 


2 -1 
2 1 


2-2 
2 1 


1 0 
00 
01 
00 
00 
1 0 


-56-31 


I 


0.750 
1.140 0.112 
0 
-0.274 
0.250 
1.22 
0 
1.112-0.5130.5000 
o 
0 
0 
1 


I 


1 
0 0 0 
o 
1 0 0 
o 
0 1 0 
5- 6 3 1 


translation 
to origin 
translation 
back to 
initial 
position 


The 'ACT884 7 Floating- Point Unit can perform 
matrix 
manipulation 
involving 


multiplications and additions such as those represented by equation (1). The matrix 
equation (3) can be solved by using the' ACT884 7 to compute, as a first step, the 
product matrix of the coordinate matrix and the first translation matrix of the right- 
hand side of equation (3) in that order. The second step involves postmultiplying the 
rotation matrix by the product matrix. The third step implements the back-translation 
by premultiplying the matrix result from the second step by the second translation 
matrix of equation (3). Details of the procedureto produce a three-dimensional rotation 
about an arbitrary axis are explained in the following steps: 


Translate the tetrahedron so that the axis of rotation passesthrough the origin. This 
process can be accomplished by multiplying the coordinate matrix by the translation 
matrix as follows: 


-3 
3 
-2 
2 


-1 
2 


-2 
2 


o 
0 
o 
0 
1 
0 
-3 
1 


1 
0 
o 
1 
o 
0 
-5 
6 


translation 
to origin 


-3 
+3 
0 
-4 
+4 
-1 
-3 
+5 
-1 
-3 
+4 
-1 


(2-5) 
(-3+6) 
(3-3) 
1 
(1 - 5) 
(-2+6) 
(2-3) 
1 
(2-5) 
(-1 
+6) 
(2-3) 
1 
(2-5) 
(-2+6) 
(2-3) 
1 


I 


vertices of translated 
tetrahedron 


The' ACT884 7 could compute the translated coordinates AT, BT, CT, DT as indicated 
above. However, an alternative method resulting in a more compact solution is 
presented below. 


Rotate the tetrahedron about the axis of rotation which passesthrough the origin after 
the translation of Step 1. To implement the rotation of the tetrahedron, postmultiply 
the rotation matrix [RI by the translated coordinate matrix from Step 1. The resultant 
matrix representsthe rotated coordinatesof the tetrahedron about the origin as follows: 


-3 
3 
0 
1 
-4 
4 
-1 
1 
-3 
5 
-1 
1 
-3 
4 
-1 
1 


0.750 
1.140 
0.112 
0 
-0.274 
0.250 
1.22 
0 
1.112 
-0.513 
0.500 
0 
0 
0 
0 
1 


I 


-3.072 
-2.670 
3.324 
1 
-5.208 
-3.047 
3.932 
1 
-4.732 
-1.657 
5.264 
1 
-4.458 
-1.907 
4.044 
1 


I 


Translate the rotated tetrahedron back to the original coordinate space. This is done 
by premultiplying the resultant matrix of Step 2 by the translation matrix. The following 
calculations produces the final coordinate matrix of the transformed object: 


- 3.072 
- 2.670 3.324 
1 
- 5.208 
- 3.047 3.932 
1 


- 4.732 
- 1.657 5.264 
1 
- 4.458 
- 1.907 4.044 
1 


1 
0 
0 
0 
o 
1 
0 
0 
o 
0 
1 
0 
5 -6 
3 
1 


1.928 
- 8.670 
6.324 
1 
-0.208 
-9.047 
6.932 
1 
0.268 
-7.657 
8.264 
1 
0.542 
-7.907 
7.044 
1 


A more compact solution to these transformation matrices is a product matrix that 
combines the two translation matrices and the rotation matrix in the order shown in 
equation (3). Equation (3) will then take the following form: 


xa ya za h1 
xb yb zb h2 
xc yc zc h3 
xd yd zd h4 


2 
-3 
3 
1 
0.750 
1.140 
0.112 
0 
1 
-2 
2 
1 
-0.274 
0.250 
1.220 
0 
2 
-1 
2 
1 
1.112 
-0.513 
0.500 
0 
2 
-2 
2 
1 
-3.730 
-8.661 
8.260 
1 


The newly transformed 
coordinates 
resulting from the postmultiplication 
of the 
transformation 
matrix by the coordinate matrix of the tetrahedron can be computed 
using equation (1) which was cited previously: 


n 


Cij =E aik * Xkj 
for i = 1, ... 
,n 
j = 1, ... 
,n 


k=1 


a11 * x11 + a12 * x21 + a13 * x31 + a14 * x41 
2 * 0.750 
+ (- 3) * (- 0.274) 
+ 3 * 1. 112 + 1 * (- 3.73) 
1.5 + 0.822 
+ 3.336 
- 
3.73 
1.928 


a11 * X12 + a12 * X22 + a13 * X32 + a14 * x42 
2 * 1.140 
+ (-3) * 0.250 
+ 3 * (-0.513) 
+ 1x(-8.661) 
2.28 
-0.75 
- 
1.539 
- 
8.661 
-8.67 


a11 * x13 + a12 * x23 + a13 * x33 + a14 * x43 
2 * 0.112 
+ (- 3) * 1.220 
+ 3 * 0.500 
+ 1 * 8.260 
0.224 
- 
3.66 
+ 1.5 + 8.260 
6.324 


h1 = c14 = 
a11 * x14 + a12 * x24 + a13 * x34 + a14 * X44 
2 * 0 + (-3) * 0 + 3 * 0 + 1 * 1 
0+0 
+ 0 + 1 
1 
A' = 
[1.928 
- 
8.67 6.324 
1] 


B' 
= [- 5.208 
- 3.047 3.932 
1] 
C' 
= [-4.732 
-1.657 
5.264 
1) 
0' 
= [- 4.458 
- 1.907 4.044 
1] 


Microinstructions for Sample Matrix Manipulation 


The 'ACT884 7 FPU can compute the coordinates for graphic objects over a broad 
dynamic range. Also, the homogeneous scalar factors h1, h2, h3 and h4 may be made 
unity due to the availability of large dynamic range. In the example presented below, 
some of the calculations pertaining to vertex A' are shown but the same approach 
can be applied to any number of points and any vector space. 


The calculations 
below 
show 
the 
sequence of 
operations 
for 
generating 
two 
coordinates, xa and ya, of the vertex A' after rotation. The same sequence could be 
continued to generate the remaining two coordinates for A' (za and h1I. The other 
vertices of the tetrahedron, B', C', and 0', can be calculated in a similar way. 


Table 3 presents a pseudocode description of the operations, clock cycles, and register 
contents for a single-precisionmatrix multiplication using the sum-of-products sequence 
presented in an earlier section. Registers used include the RA and RB input registers 
and the product (P) and sum (S) registers. 


CLOCK 
MULTIPLIER/ALU 
PSEUDOCODE 
CYCLE 
OPERATIONS 


1 
Load all, 
x 11 
all 
-+ RA, xll 
-+RB 
SP Multiply 
pl 
= all * xll 


2 
Load a12, x21 
a12 -+RA, x21 -+RB 
SP Multiply 
p2 = a12 * x21 
Pass P to S 
pl -+ P(pl) 


3 
Load a13, x31 
a13-+ 
RA, x31 -+RB 
SP Multiply 
p3 = a13 * x31, p2 -+P(p2) 
Add P to S 
P(p1) + 0 -+ SIp 1) 


4 
Load a14, x41 
a14 -+ RA, x41 -+ RB 
SP Multiply 
p4 = a14 * x41, p3 -+ P(p3) 
Add P to S 
P(p2) + S(pl) -+ S(pl 
+ p2) 


5 
Load all, 
xl 2 
all 
-+ RA, xl 2 -+ RB 
SP Multiply 
p5 = all * x12, p4-+P(p4) 
Add P to S 
P(p3) + S(pl 
+ p2) -+ S(pl 
+ p2 + p3) 


6 
Load a12, x22 
a12 -+ RA, x22 -+ RB 
SP Multiply 
p6 = a12 * x22, p5 -+ P(p5) 
Pass P to S 
P(p4) + S(pl 
+ p2 + p3) -+ 
Output S 
S(pl 
+ p2 + p3 + p4) 


7 
Load a13, x32 
a13 -+RA, x32-+ RB 
SP Multiply 
p7 = a13 * x32, p6 -+ P(p6) 
Add P to S 
P(p5) + 0 -+ S(p5) 


8 
Load a14, x42 
a14-+RA, x42 -+RB 
SP Multiply 
p8 = a14 * x42, p7 -+ P(p7) 


Add P to S 
P(p6) + S(p5)-+ S(p5 + p6) 


9 
Next operands 
A -+ RA, B -+ RB 
Next instruction 
pi = A * B, p8 -+ P(p8) 
Add P to S 
P(p7) + S(p5 + p6) -+ S(p5 + p6 + p7) 


10 
Next operands 
C -+ RA, D -+ RB 
Next instruction 
pj = C * D, pi -+ P(pi) 
Output S 
P(p8) + S(p5 + p6 + p7) -+ 
S(p5 + p6 + p7 + p8) 


S 
E 
C C C 
L 
LOOPP 
SS 
M 
S S 
K N N I I 
E E 
S 
BEE 
R 
M FF 
PP 
L L 
RR 
FEE 
S / 
YLLEH 
0 II 
E E 
00 
NN 
ANNR 
OOOTSSSATT 
I I 
DGGSS 
PP 
DO 
SRRCLEEEETTELPP 
10-0 
E 1-02-0 
7-0 
1-0 
TAB 
C S Y C S P 1-0 
T T 1-0 


000 0100 0000 
o 01 
0101111 
xxxx 
00 
o 1 
x x x x x x 
xx 
11 
100 01 10 0000 
o 01 
0101111 
xxxx 
00 
o 1 
x x x x x x 
xx 
11 
100 0000 0000 
o 01 
0101111 
1010 
00 
o 1 
x x x x x x 
xx 
11 
100 0000 0000 
o 01 
01011111010 
00 
o 1 
x x x x x x 
xx 
11 
100 0000 0000 
o 01 
01011111010 
00 
o 1 
x x x x x x 
xx 
11 


100 011 0 0000 
o 01 
0101111 
xxxx 
00 
0 
x x x x x x 
xx 
1 
11 
100 0000 0000 
o 01 
0101111 
1010 
00 
o 1 
x x x x x x 
xx 
1 
11 
100 0000 0000 
o 01 
0101111 
1010 
00 
o 1 
x x x x x x 
xx 
1 
11 
100 0000 0000 
o 01 
0101111 
1010 
00 
o 1 
x x x x x x 
xx 
1 
11 
100 011 0 0000 
o 01 
0101111 
xxxx 
00 
o 1 
x x x x x x 
xx 
1 
11 


Six cycles are required to complete calculation of xa, the first coordinate, and after· 
four more cycles the second coordinate ya is output. Eachsubsequent coordinate can 
be calculated in four cycles so the 4-tuple for vertex A' requires a total of 18 cycles 
to complete. 


Calculations for vertices B', C', and 0', can be executed in 48 cycles, 16 cycles for 
each vertex. Processing time improves when the transformation matrix is reduced, 
i.e., when the last column has the form shown below: 


The h-scalars h1, h2, h3, and h4 areequal to 1. The number of clock cycles to generate 
each 4-tuple can then be decreasedfrom 16 to 13 cycles. Total number of clock cycles 
to calculate all four vertices is reduced from 66 to 54 clocks. Figure 2 summarizes 
the overall matrix transformation. 


1° 
C 


1 
1 
I 
B 
I 
C' 


1 


B' 
0' 


lA' 
P (5, -6.3) 
I 
I 
I 
I 
Y' 


This microprogram can also be written to calculate sums of products with all pipeline 
registers enabled so that the FPUcan operate in its fastest mode. Because of timing 
relationships, the C register is used in some steps to hold the intermediate sum of 
products. Latency due to pipelining and chained data manipulation is 11 cycles for 
calculation of the first coordinate, and four cycles each for the other three coordinates. 


After calculation of the first vertex, 16 cycles are required to calculate the four 
coordinates of each subsequent vertex. Table 5 presents the sequenceof calculations 
for the first two coordinates, xA and yA. 


Products in Table 5 are numbered according to the clock cycle in which the operands 
and instruction were loaded into the RA, RB, and I register, and execution of the 
instruction began. Sums indicated in Table 5 are listed below: 


p1 + 0 
p1 + p3 
p2 + p4 
p5 + 0 


p5 + p7 
p6 + p8 
p9 + 0 
p9 + p11 


s9 = p10 + p12 
xA 
p1 + p2 +.p3 
+ p4 
yA = p5 + p6 + p7 + p8 


CLOCK 
I 
DA 
DB 
I 
RA 
RB 
MUL 
ALU 
P 
S 
C 
Y 
CYCLE 
BUS 
BUS 
BUS 
REG 
REG 
REG 
PIPE 
PIPE 
REG 
REG 
REG 
BUS 


0 
Mul 
x11 
a11 
1 
Mul 
x21 
a12 
Mul 
x11 
a11 
2 
Chn 
x31 
a13 
Mul 
x21 
a12 
p1 
3 
Mul 
x41 
a14 
Chn 
x31 
a13 
p2 
p1 
4 
Chn 
x12 
a11 
Mul 
x41 
a14 
p3 
s1 
p2 
5 
Chn 
x22 
a12 
Chn 
x12 
a11 
p4 
t 
p3 
s1 
p2 
6 
Chn 
x32 
a13 
Chn 
x22 
a12 
p5 
s2 
p4 
t 
p2 
7 
Chn 
x42 
a14 
Chn 
x32 
a13 
p6 
s3 
p5 
s2 
p2 
8 
Chn 
x13 
a11 
Chn 
x42 
a14 
p7 
s4 
p6 
s3 
s2 
9 
Chn 
x23 
a12 
Chn 
x13 
a11 
p8 
xA 
p7 
s4 
p6 
10 
Chn 
x33 
a13 
Chn 
x23 
a12 
p9 
s5 
p8 
xA 
p6 
xA 
11 
Chn 
x43 
a14 
Chn 
x33 
a13 
p10 
s6 
p9 
s5 
p6 
12 
Chn 
x14 
a11 
Chn 
x43 
a14 
p11 
s7 
p10 
s6 
s5 
13 
Chn 
x24 
a12 
Chn 
x14 
a11 
p12 
yA 
p11 
s7 
p10 
14 
Chn 
x34 
a13 
Chn 
x24 
a12 
p13 
s8 
p12 
yA 
p10 
yA 
15 
Chn 
x44 
a14 
Chn 
x34 
a13 
p14 
s9 
p13 
s8 
p10 


Chebyshev Routines for the SN74ACT8847 
FPU 


Introduction 


Using the 
SN74ACT8847, 
very 
efficient 
routines 
can 
be developed for 
the 
implementation of transcendental functions. A high degreeof accuracy can be achieved 
by taking advantage of the' ACT884 7' s ability to perform calculations using double- 
precision floating point operands. 


This application note describeshow to usethe' ACT884 7 to implement seven different 
transcendental functions. TIM (Texas Instruments Meta-Macro Assembler) assembly 
files have been written for all seven functions and these files are availableupon request 
from Texas Instruments. The algorithm chosen to implement these functions is the 
Chebyshev expansion method [1]. 
Table 6 lists the functions 
that 
have been 
implemented, along with the number of cycles required, and time required to perform 
the calculations. Also listed in the table is the cycle count and time required to perform 
the same calculation using the Motorola MC68881 Floating Point Coprocessor and 
the Intel 80387 Numeric Processor Extension. 


The Chebyshev expansion method was chosen rather than some of the more well 
known methods, such as the Taylor series and Newton-Raphson approximation, for 
a variety of reasons. The primary advantage of Chebyshev's method is that it provides 
a uniform convergence rate in the number of terms required to achieve the desired 
accuracy. Thus the range of the input value will have little effect on the accuracy of 
the result. Another advantage is that the number of terms required to calculate the 


approximation is relatively small. This provides for faster execution. Also, Chebyshev's 
method can be applied to any function which is continuous and of bounded variation. 
Lastly, tables are available which contain the constants necessary to implement 
Chebyshev's method. 


In order that this application note be useful to the largest audience, only those 
instructions and features common to all 'ACT884 7 versions have been used to 
implement the routines. 


Contact Texas Instruments VLSI Logic applications group at (214) 997-3970 for a 
copy of the seven TIM assembly files. 


CYCLE COUNTt 
EXECUTION SPEED* 
FUNCTION 
IN MICROSECONDS 
'ACT8847 
MC68881 
80387 
'ACT8847 
MC68881 
80387 


Sine 
51 
416 
122 to 
1.53 
25.0 
7.32 to 
771 
46.3 


Cosine 
51 
416 
123 to 
1.53 
25.0 
7.38 to 
772 
46.3 


Tangent 
84 
498 
191 to 
2.52 
29.9 
11.5 to 
497 
29.8 


ArcSine 
68 
606 
Not 
2.04 
36.4 
Not 
Avail. 
Avail. 


ArcCosine 
68 
650 
Not 
2.04 
39.0 
Not 
Avail. 
Avail. 


ArcTangent 
104 
428 
314 to 
3.12 
25.7 
18.8 to 
487 
29.2 


Exponentiation 
52 
522 
Not 
1.56 
31.3 
Not 
Avail. 
Avail. 


tFor MC68881 cycle count refer to 'MC68881 Floating Point Coprocessor User's Manual', Document No. 
MC68881 UM/AD, Page 6-13. For 80387 cycle count refer to '80387 Programmer's Reference Manual', 
Document No. 231917-001, 
Page E-36. 
+'ACT8847 cycle speed is 30 ns, 33 MHz 
MC68881 cycle speed is 60 ns, 16.6 MHz 
80387 cycle speed is 40 ns, 25 MHz 


If fIx) is continuous and of bounded variation over the interval - 1 ~ x ~ 1, then 
fIx) may be approximated by the following equation: 


00E arTr(x) 
r=O 


Note that the range for x is between - 1 and 1. For most functions, this restriction 
requires that the input, x, be range reduced before the calculation begins. Range 
reducing an argument means to scale the argument down to a certain range. In the 
case of Chebyshev approximations, the range is usually - 1 :5 x :5-1, or 0 :5 x :5 1. 


In the equation for fIx) above, the constants representedby an are known as Chebyshev 
coefficients. The variables represented by Tr are known as Chebyshev polynomials 
and can be derived from the following relationship and values: 


Tr+ 1(x) - 
2xTr(x) + Tr-1 
(x) = 0, 
TO(x) = 1, 
T1(x) = x 


To illustrate Chebyshev's expansion method, the procedure to approximate function 
fIx) using the first seven polynomials is now covered. Let 


fIx) = 1/2aO + 
a1T1(x) + 
a2T2(x) + 
a3T3(x) + 
a4T4(x) + 
a5T5(x) + 
a6T6(x) 


fIx) = 1/2aO + 
a1(x) + 
a2(2x2-1) 
+ 
a3(4x3 - 3x) + 
a4(8x4-8x2 
+ 1) + 
a5(16x5-20x3+5x) 
+ 
a6(32x6 - 48x4 + 18x2 - 1) 


fIx) = xO(1/2aO - 
a2 + a4 - 
a6) + 
x1(a1 - 
3a3 + 5a5) + 
x2(2a2 
8a4 + 18a6) + 
x3(4a3 
- 
20a5) + 
x4(8a4 
- 
48a6) 
+ 
x5(16a5) 
+ 
x6(32a6) 


Let cO = 1/2aO - 
a2 + a4 - 
a6 
c1 = a1 - 
3a3 + 5a5 
c2 = 2a2 - 
8a4 + 18a6 
c3 = 4a3 - 
20a5 
c4 = 8a4 - 
48a6 
c5 = 16a5 
c6 = 32a6 


Substituting the c's into the last equation for fIx), 


fIx) = CoxO + c1x 1 + C2x2 + C3x3 "+ 
C4x4 + C5x5 + c6x6 


Applying Horner's Rule yields, 


fIx) = (((((C6X + C5)x + C4)X + 
C3)x + C2)x + C1)x + cO 


In the remainder of the paper, the above equation will be referred to as Cseries. 
Therefore, 


Cseries_f(x) = (((((C6x + C5)x + C4)X + 
C3)x + C2)x + C1)x + cO 


The last step prior to approximating fIx) is to calculate the c's by substituting the values 
for the Chebyshev coefficients into the equations for cO through c6. 


Format for the Remainder of the Application Note 


Each of the seven functions will be covered in a separate section. Each section will 
include the following information: 


1. General steps required to perform the calculation including a description of 
any preprocessing and/or postprocessing 
2. An algorithm for each of the above steps 
3. What system intervention, if any, is required; this intervention may take the 
form of branching based on comparision status generated by the' ACT884 7, 
or storing and then later retrieving intermediate results 
4. The number of 'ACT884 7 cycles required to calculate fIx) 
5. A listing of the c's 
6. Pseudocode table showing 
how the calculation 
is accomplished. 
The 
pseudocode tables list the contents of all the reievent 'ACT884 7 registers 
and buses for each instruction. 
7. Microcode table listing the instructions 
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Cosine Routine Using Chebyshev's Method 


Steps Required to Perform the Calculation 


STEP 1 - 
Preprocessing; range-reduce the input, X, to a range of [ - 1,1). Next 
squarethis range-reducedvalue, multiply it by 2.0, and finally subtract 
1.0. X3 is the range-reducedinput value, it must be stored externally. 
'TRUNC' means to truncate. 


X1 .-- X*(2.0/pi) 
X2.-- 
(4(TRUNC(0.25(X1 
+ 2.0)))) - 
X1 + 1.0 
If X2 > 1.0 
Then X3 .-- 2.0 - 
X2 
Else X3 .-- X2 
X4'-- 
2.0*(X3*X3) 
- 
1.0 


STEP 2 - 
Core Calculation; X4 in Step 1 will be referred to as 'x' in the core 
calculation. 


X5.-- 
Cseries_cos 
.-- (((((((ca*x 
+ C7)*X + C6)*X + C5)*x + 
C4)*X + C3)*x + C2)*x + c1 )*x + cO 


STEP 3 - 
Postprocessing; multiply the output of the core calculation times X3. 


T1 
-X*(2.0/pil 
T2 
-T1 
+ 2.0 


T3 
-0.25*T2 
and 


T4 
-1.0 
- 
CREG 


T5 
-INT(T3l 
T6 
-4*T5 
T7 
-DOUBLE(T6l 


T8 
- T7 + CREG 


CMP (1.0,T8l 
If (1.0 > T8l 
Then T9 - 
2.0 - 
CREG 


Else T9 - 
CREG 


T10 -CREG*CREG 
T11 -T10 
*2.0 
T12-T11 
- 
1.0 


T13 -c8*CREG 
T14-T13 
+ c7 
T15 -T14*CREG 
T16-T15+C6 
T17 -T16*CREG 
T18 -T17 
+ c5 
T19 -T18*CREG 
T20 -T19 
+ c4 
T21 -T20*CREG 
T22 -T21 
+ c3 
T23 -T22*CREG 
T24 -T23 
+ c2 
T25 -T24*CREG 
T26 -T25 
+ c1 
T27 -T26*CREG 
T28 -T27 
+ cO 


Step 3 perform the postprocessing: 


Cosine(Xl - 
T28*T9 


CREG- 
T1, T3 and T4 result 
from a chained instruction 
round controls set to truncate 
CREG- 
T4 


convert from integer to double 


CREG- 
T8 
T9 is X3 in Step 1, must 
be stored externally 
CREG-+ T9 


T12 is X4 in Step 1, the 
input to the core routine 


Required System Intervention 


As seen in the algorithm for Step 1, the' ACT884 7 performs a compare. The results 
of this compare determine which one of two calculations is to be performed. The 
system, in which the' ACT684 7 is a part, must make the decision as to which of the 
two calculations is to be performed. In addition, the system must store X3 and then 
later furnish X3 as an input to the' ACT884 7. 


Number of 'ACT884 7 Cycles Required to Calculate Cosine(x) 


Calculation of Cosine(x) requires 46 cycles. In addition, it is assumed that five additional 
cycles are required due to the compare instruction, and resulting system intervention. 
Therefore, the total number of cycles to perform the Cosine(x) calculation is 51. 


c8 = 3D19D46B7D4C8F32 
C7 = BD962909C5C01 ED6 
c6 = 3EOD53517735F927 
c5 = BE7CC930FDOADA9D 
c4 = 3EE3EOAF61F7677F 
c3 = BF41E5FDEF25C403 
c2 = 3F92A9FB40C 119ED 
c1 = BFD23B03366AAOC9 
cO = 3FF4464BCC8CBA 1F 


ClK 
DA 
DB 
RA 
RB 
ClK 
MUl 
ALU 
P 
C 
S 
y 


BUS 
BUS 
REG 
REG 
INSTR 
REG 
REG 
BUS 
COMMENT 
MODE 
PIPE 
PIPE 
REG 


1 
X MSH 
X lSH 
0 
RA2.RB2 
X is the input 


2DIVPI 
2DIVPI 
2DIVPI is a constant 
2 
MSH 
lSH 
X 
2DIVPI 
0 
RA2.RB2 
representing 
2.0/pi 


Preload RA with 
1.0 for 
3 
1.0 MSH 
1.0 LSH 
X 
2DIVPI 
0 
PR4+RB4 
RA2.RB2 
use in cycles 
5 and 11 


4 
2.0 MSH 
2.0 LSH 
1.0 
2.0 
0 
PR4+RB4 
Pl 


SR5.RB5 
5 
0.25 
MSH 
0.25 
LSH 
1.0 
0.25 
1 
RA5-CR5 
Pl 
Sl 


6 
1.0 
0.25 
0 
DP2I(PR7) 
SR5.RB5 
RA5-CR5 
Double precision 
- 
integer 


7 
1.0 
0.25 
0 
DP21(PR7) 
P2 
S2 
Cycles 6.7 set RND1, 0 = 01 


8 
4 
1.0 
4 
0 
SR8.RB8 
S2 
S3 


9 
1.0 
4 
1 
12DP(PR9) 
P3 
Integer - 
double-precision 


10 
1.0 
4 
1 
CR10+SR10 
S4 


COMPARE 
If SR 11 > RA 11 then 
13a 
11 
1.0 
4 
1 
RAll,SRll 
S5 
If SR 11 :s RA 11 then 
13b 


12 
1.0 
4 
0 
NOP 
S5 
Wait for system 
response 


13a 
2.0 MSH 
2.0 LSH 
1.0 
2.0 
1 
RB13-CR13 
Execute 
13a or 13b 


13b 
1.0 
4 
1 
PAS(CR13) 
Pass contents 
of CREG 


S6 is either RB13-CR13 
or 


14 
1.0 
2.0 
1 
CR14.CR14 
S6 
S6 
CR13 from 
PASS CR13, 
and 
or 4 
must be stored externally 


for use in cycle 43 


2.0 
Output 
S6 in cycles 
14 and 
15 
2.0 MSH 
2.0 LSH 
1.0 
or 4 
0 
RA16.PR16 
CR14.CR14 
S6 
S6 
15 


2.0 
16 
2.0 
or 4 
0 
RA16.PR16 
P4 


CLK 
DA 
DB 
RA 
RB 
CLK 
INSTR 
MUL 
ALU 
P 
C 
S 
y 
COMMENT 
BUS 
BUS 
REG 
REG 
MODE 
PIPE 
PIPE 
REG 
REG 
REG 
BUS 


2.0 
17 
2.0 
or 4 
0 
PR18+RB18 
RA16.PR16 


18 
-1.0 
MSH 
-1.0 
LSH 
2.0 
-1.0 
0 
PR18+RB18 
P5 


19 
c8 MSH 
c8 LSH 
2.0 
c8 
1 
SR19.RB19 
S7 
Start core calculation 


20 
2.0 
c8 
0 
PR21 +RB21 
SR19.RB19 
S7 
S7 is input to core calc. 


21 
c7 MSH 
c7 LSH 
2.0 
c7 
0 
PR21 +RB21 
P6 


22 
2.0 
c7 
1 
SR22.CR22 
S8 


23 
2.0 
c7 
0 
PR24+RB24 
SR22.CR22 


24 
c6 MSH 
c6 LSH 
2.0 
c6 
0 
PR24+RB24 
P7 


25 
2.0 
c6 
1 
SR25.CR25 
S9 


26 
2.0 
c6 
0 
PR27+RB27 
SR25.CR25 


27 
c5 MSH 
c5 LSH 
2.0 
c5 
0 
PR27+RB27 
P8 


28 
2.0 
c5 
1 
SR28.CR28 
S10 


29 
2.0 
c5 
0 
PR30+RB30 
SR28.CR28 


30 
c4 MSH 
c4 LSH 
2.0 
c4 
0 
PR30+RB30 
P9 


31 
2.0 
c4 
1 
SR31.CR31 
S11 


32 
2.0 
c4 
0 
PR33+RB33 
SR31.CR31 


33 
c3 MSH 
c3 LSH 
2.0 
c3 
0 
PR33+RB33 
P10 


34 
2.0 
c3 
1 
SR34.CR34 
S12 


35 
2.0 
c3 
0 
PR36+RB36 
SR34.CR34 


36 
c2 MSH 
c2 LSH 
2.0 
c2 
0 
PR36+RB36 
P11 


ClK 
DA 
DB 
RA 
RB 
ClK 
MUl 
AlU 
P 
C 
S 
y 
COMMENT 
BUS 
INSTR 
REG 
BUS 
REG 
REG 
MODE 
PIPE 
PIPE 
REG 
REG 
BUS 


37 
2.0 
c2 
1 
5R37.CR37 
513 


38 
2.0 
c2 
0 
PR39+RB39 
5R37.CR37 


39 
c1 M5H 
c1 l5H 
2.0 
c1 
0 
PR39+RB39 
P12 


40 
2.0 
c1 
1 
5R40.CR40 
514 


41 
2.0 
c1 
0 
PR42+RB42 
5R40.CR40 


42 
cO M5H 
Co L5H 
2.0 
cO 
0 
PR42+RB42 
P13 


43 
56 M5H 
56 L5H 
2.0 
56 
1 
5R43.RB43 
515 
Begin postprocessing 


Instruction is double- 


44 
2.0 
56 
0 
DUMMY 
5R43.RB43 
precision RA + RB. allows 


time for answer to 


propagate to the Y bus 


45 
2.0 
56 
0 
NOP 
P14 
P14 
Output M5H of answer 


46 
2.0 
56 
0 
NOP 
P14 
P14 
Output L5H of answer 


Microcode Table for the Cosine(x) Calculation 


All numbers are in hex. Any field with a length that is not a multiple of 4 is right justified and zero filled. For the microcode 
table, the value of X has been chosen to be 1/2 pi. 


P 
0 
0 
PEE 
C 
P 
C C s 
R H E 
F 
I 
R F S B S T S 000 
A 
A 
B 
B N N L 
I 
L 0 
E 
E A N 
L 
N 
N A RYE 
E E E E E 
A B K 
P 
K N 
L 
S 
L C 0 
S 
o S C T 
L SLY 
S C 
C 
EMF 
0 
E T 
W 
T 
T C EST 
Y 
SOl 
P 
T 
C 
R 
P T 
o G 


F 3FF921FB 
54442018 
F 
00_ 
2 
0 3 
FF 
1 
1 1 o 1CO 0000331 
000 
F 3FE45F30 
60C9C883 
F 
1 1 _ 
2 
0 3 
FF 
1 
1 1 o 1CO 0000331 
000 
F 3FFOOOOO00000000 
FOO_2 
o 3 
FB 1 
1 1 o 180 
o 000 
3 3 
1 000 
F 40000000 
00000000 
F 
1 1 _ 
2 o 3 
FB 1 
1 1 o 180 
o 0 003 
3 
1 000 
F 3FOOOOOO00000000 
F01J2 
1 3 
BO 1 
1 0 
0 581 
001 
033 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 1 
1 1 o 1A3 
1 000 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 1 
1 1 o 1A3 
1 000 
3 3 
1 000 
F 00000000 
00000004 
F01...f"2 
o 1 
BF 1 
1 0 
0 240 
o 0 0 0 
3 3 
1 000 
F 00000000 
00000000 
F 00_ 
2 
1 3 
FB 1 
1 1 o 1A2 
000 
0 3 3 
1 000 
F 00000000 
00000000 
F 
00_ 
2 
1 3 
F6 
1 
1 1 o 180 
0000331 
000 
F 00000000 
00000000 
F 00_ 
2 
1 3 
FE 1 
1 1 o 182 
0000331 
000 
F 00000000 
00000000 
F 
0 0 S 
2 
0 3 
FF 
1 
1 0 
0 300 
0000331 
000 
F 00000000 
00000000 
FOO_2 
1 3 
F7 
1 
1 1 o 1AO o 0 0 0 3 3 
1 000 
F 00000000 
00000000 
F 00_ 
2 
1 3 
5F 
1 
1 1 o 1CO 0000331 
000 
F 40000000 
00000000 
FOO...f"2 
o 3 
EF 1 
1 0 
0 1CO o 000 
3 3 
1 000 
F 00000000 
00000000 
F10_2 
o 3 
EF 1 
1 1 o 1CO o 0 0 0 
3 3 
1 000 
F 00000000 
00000000 
F 00_ 
2 o 3 
FB 1 
1 1 o 180 
o 0 0 0 
3 3 
1 000 
F BFFOOOOO00000000 
F 
0 
1 _ 
2 
0 3 
FB 1 
1 1 o 180 
o 0 0 0 
3 3 
1 000 
F 3D19046B 
704C8F32 
F 
0 
1 _ 
2 
1 3 
BF 1 
1 1 o 1CO o 0 0 0 
3 3 
1 000 
F 00000000 
00000000 
F 
0 0 S 
2 
0 3 
FB 1 
1 0 
0 180 
0000331 
000 


--.J 
F B0962909 
C5C01E06 
F 
0 
1 _ 
2 
0 
3 
FB 1 
1 1 o 180 
0000331 
000 
ww 


-...I 
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Microcode Table for the Cosine(x) Calculation (Continued) 


p 
D 
D 
PEE 
C 
P 
C C s 
R H E 
F 
I 
R F S B S T S a a a 
A 
A 
B 
B 
N N L 
I 
L 0 
E 
E A N 
L 
N 
N A RYE 
E E E E E 
A B K 
P 
K N 
L 
S 
L C 
0 
S 
D S C T L SLY 
S C 
C 
EMF 
0 
E T 
W 
T 
T C EST 
Y 
SOl 
P 
T 
C 
R 
P T 
D G 


F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 003 
3 1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 1 
1 1 o 180 o 0 003 
3 1 000 
F 3EOD5351 
7735F927 
F01_2 
o 3 
FB 1 
1 1 o 180 
000033100 
0 
F 00000000 
00000000 
FOO_2 
1 3 
9F 1 
1 1 o 1CO o 0 0 0 3 3 1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 1 
1 1 o 180 
0000331 
000 
F BE7CC930 
FDOADA9D 
F 0 
1 _ 
2 o 3 
FB 1 
1 1 o 180 o 000 
3 3 1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 000 
3 3 1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 
1 
1 1 o 180 o 000 
3 3 1 000 
F 3EE3EOAF 
61F7677F 
F01_2 
o 3 
FB 
1 
1 1 o 180 o 0 0 0 3 3 1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 0 0 3 3 1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 
1 
1 1 o 180 o 0 003 
3 1 000 
F BF41E5FD 
EF25C403 
F01_2 
o 3 
FB 
1 
1 1 o 180 o 0 003 
3 1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 003 
3 1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 
1 
1 1 o 180 
0000331 
000 
F 3F92A9FB 
40C119ED 
F 0 1 _ 
2 o 3 
FB 
1 
1 1 o 180 
000 
0 3 3 1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO 
000 
0 3 3 1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 
1 
1 1 o 180 o 000 
3 3 1 000 
F BFD23B03 
366AAOC9 
F 0 1 _ 
2 o 3 
FB 
1 
1 1 o 180 o 000 
3 3 1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 000 
3 3 1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 
1 
1 1 o 180 o 000 
3 3 1 000 
F 3FF4464B 
CC8CBA 1F F 0 
1 _ 
2 o 3 
FB 
1 
1 1 o 180 
000033100 
0 
F 00000000 
00000000 
F01_2 
1 3 
BF 
1 
1 1 o 1CO 
000033100 
0 
F 00000000 
00000000 
FOO_2 
o 3 
FF 
1 
1 1 o 180 
000033100 
0 
F 00000000 
00000000 
FOO_2 
o 3 
FF 
1 
1 1 o 300 
000033100 
0 
F 00000000 
00000000 
FOO_2 
o 3 
FF 
1 
1 1 o 300 
o 000 
3 3 0 000 


All floating point inputs and outputs are double precision. The input is in radians. 


Steps Required to Perform the Calculation 


STEP 1 - 
Preprocessing; range reduce the input, X, to a range of [ - 1,1). Next 
square this range-reducedvalue, multiply it by 2.0, and finally subtract 
1.0. X3 is the range-reducedinput value, it must be stored externally. 
'TRUNC' means to truncate. 


X1 +- X*(2.0/pi) 
X2 +- X1 - 
(4(TRUNC(0.25(X1 
+ 1.0)))) 


If X2 > 1.0 


Then X3 +- 2.0 - 
X2 
Else X3 +- X2 
X4 +- 2.0*(X3*X3) 
- 
1.0 


STEP 2 - 
Core calculation; X4 in Step 1 will be referred to as 'x' in the core 
calculation. 


X5 +- Cseries_sin 


+- (((((((cS**x 
+ C7)*x + C6)*x + C5)*x + 
C4)*x + C3)*x + C2)*x + c1 )*x + cO 


STEP 3 - 
Postprocessing; multiply the output of the core calculation times X3. 


Sine(X) +- X5*X3 


Algorithms for the Three Steps 


T1 
+-X*(2.0/pi) 
T2 
+-T1 + 1.0 
T3 
+-0.25*T2 
T4 
+-INT(T3) 
T5 
+-4*T4 


T6 
+-DOUBLE(T5) 


T7 
+-CREG - 
T6 


CMP (1.0,T7) 
If (1.0> 
T7) 


Then TS +- 2.0 - 
CREG 
Else TS +- CREG 


CREG +- T1 
round controls set to truncate 


compare 1.0 to T7 
CREG +- T7 
TS is X3 in Step 1, must 
be stored externally 
CREG -+ TS 


T9 
+-CREG*CREG 


T10 +-T9 *2.0 
T11+-T10-1.0 
T11 is X4 in Step 1 above, the input to 
the core routine 
T11 
= 'x' from Step 2 above 


T12 +-c8*CREG 
T13+-T12 
+ C7 
T14 +-T13*CREG 
T15 +-T14 + c6 
T16 +-T15*CREG 
T17+-T16 
+ c5 
T18 +-T17 *CREG 
T 19 +-T18 + c4 
T20 +-T19*CREG 
T21 +-T20 + c3 
T22 +-T21 *CREG 
T23 +-T22 + c2 
T24 +-T23*CREG 
T25 +-T24 + c1 
T26 +-T25*CREG 
T27 +-T26 +- cO 


Sine(X) +- T27*T8 


Required System Intervention 


As seen in the algorithm for Step 1, the 'ACT8847 
performs a compare. The results 
of this compare determine which one of two calculations is to be performed. The 
system, in which the' ACT884 7 is a part, must make the decision between which 
two calculations are to be performed. In addition, the system must store X3 and then 
later furnish X3 as an input to the' ACT884 7. 


Number of 'ACT8847 
Cycles Required to Calculate Sine(x) 


Calculation of Sine(x) requires 46 cycles. In addition, it is assumed that five additional 
cycles are required due to the compare instruction and resulting system intervention. 
Therefore, the total number of cycles to perform the Sine(x) calculation is 51. 


c8 = 3D 19D46B7D4C8F32 
C7 = BD962909C5C01 ED6 
c6 = 3EOD53517735F927 
c5 = BE7CC930FDOADA9D 
c4 = 3EE3EOAF61F7677F 
c3 = BF41E5FDEF25C403 
c2 = 3F92A9FB40C119ED 
c1 = BFD23B03366AAOC9 
cO = 3FF4464BCC8CBA 1F 


ClK 
DA 
DB 
RA 
RB 
ClK 
MUl 
AlU 
P 
C 
S 
y 
COMMENT 
INSTR 
BUS 
BUS 
BUS 
REG 
REG 
MODE 
PIPE 
PIPE 
REG 
REG 
REG 


1 
X MSH 
X lSH 
0 
RA2.RB2 
X is the input 


2DIVPI 
2DIVPI 
X 
2DIVPI 
0 
RA2.RB2 
2DIVPI is a constant 
2 
MSH 
lSH 
representing 2.0/pi 


3 
X 
2DIVPI 
0 
PR4+RB4 
RA2.RB2 


4 
1.0 MSH 
1.0 lSH 
X 
1.0 
0 
PR4+RB4 
Pl 


5 
0.25 MSH 
0.25 lSH 
X 
0.25 
1 
SR5.RB5 
Pl 
Sl 


6 
1.0 MSH 
1.0 LSH 
X 
0.25 
0 
DP2I(PR7) 
SR5.RB5 
Double precision - 
integer 


7 
1.0 
0.25 
0 
DP21(PR7) 
P2 
Cycles 6,7 set RND1,0 = 01 


8 
4 
1.0 
4 
0 
SR8.RB8 
S2 


9 
1.0 
4 
1 
12DP(PR9) 
P3 
Inte.ger - 
double precision 


10 
1.0 
4 
1 
CR10-SR10 
S3 


1.0 
COMPARE 
If SRll 
- 
RA11 then 13a 
11 
4 
1 
RAll,SRll 
S4 
If SR11 ~ RA11 then 13b 


12 
1.0 
4 
0 
NOP 
S4 
Wait for system response 


13a 
2.0 MSH 
2.0 LSH 
1.0 
2.0 
1 
RB13-CR13 
Execute 13a or 13b 


13b 
1.0 
4 
1 
PAS(CR13) 
Pass contents of CREG 


S5 is either RB13 - CR13 or 


14 
1.0 
2.0 
1 
CR14.CR14 
S5 
S5 
CR13 from PASS CR13, and 
or 4 
must be stored externally 


for use in cycle 43 


2.0 MSH 
2.0 LSH 
1.0 
2.0 
0 
RA16.PR16 
CR14.CR14 
S5 
S5 
Output 55 in cycles 14 and 
15 
or 4 
15 


2.0 
P4 
16 
2.0 
0 
RA16.PR16 
or 4 


2.0 
0 
PR18+RB18 
RA16.PR16 
17 
2.0 
or 4 


18 
-1.0 
M5H 
-1.0 
LSH 
2.0 
-1.0 
0 
PR18+RB181 
I P5 


CLK 
DA 
DB 
RA 
RB 
CLK 
MUL 
ALU 
P 
C 
S 
y 
COMMENT 
BUS 
BUS 
REG 
REG 
MODE 
INSTR 
PIPE 
PIPE 
REG 
REG 
REG 
BUS 


19 
c8 M5H 
c8 L5H 
2.0 
c8 
1 
5R19.RB19 
56 
5tartcore calculation 


20 
2.0 
c8 
0 
PR21 +RB21 
5R19.RB19 
56 
57 isinputto core calc. 


21 
c7 M5H 
c7 L5H 
2.0 
c7 
0 
PR21 +RB21 
P6 


22 
2.0 
c7 
1 
5R22.CR22 
57 


23 
2.0 
c7 
0 
PR24+RB24 
5R22.CR22 


24 
c6 M5H 
c6 L5H 
2.0 
c6 
0 
PR24+RB24 
P7 


25 
2.0 
c6 
1 
5R25.CR25 
58 


26 
2.0 
c6 
0 
PR27 +RB27 
5R25.CR25 


27 
c5 M5H 
c5 L5H 
2.0 
c5 
0 
PR27 + RB27 
P8 


28 
2.0 
c5 
1 
5R28.CR28 
59 


29 
2.0 
c5 
0 
PR30+RB30 
5R28.CR28 


30 
c4 M5H 
c4 L5H 
2.0 
c4 
0 
PR30+RB30 
P9 


31 
2.0 
c4 
1 
5R31.CR31 
510 


32 
2.0 
c4 
0 
PR33+RB33 
5R31.CR31 


33 
c3 M5H 
c3 L5H 
2.0 
c3 
0 
PR33+RB33 
P10 


34 
2.0 
c3 
1 
5R34.CR34 
511 


35 
2.0 
c3 
0 
PR36+RB36 
5R34.CR34 


36 
c2 M5H 
c2 L5H 
2.0 
c2 
0 
PR36+RB36 
P11 


37 
2.0 
c2 
1 
5R37.CR37 
512 


38 
2.0 
c2 
0 
PR39+RB39 
5R37.CR37 


39 
c1 M5H 
c1 L5H 
2.0 
c1 
0 
PR39+RB39 
P12 


40 
2.0 
c1 
1 
5R40.CR40 
513 


ClK 
DA 
DB 
RA 
RB 
ClK 
MUl 
ALU 
P 
C 
S 
Y 
COMMENT 
BUS 
BUS 
REG 
REG 
MODE 
INSTR 
PIPE 
PIPE 
REG 
REG 
REG 
BUS 


41 
2.0 
c1 
0 
PR42+RB42 
5R40.CR40 


42 
Co M5H 
cO l5H 
2.0 
Co 
0 
PR42+RB42 
P13 


43 
55 M5H 
55 l5H 
2.0 
55 
1 
5R43.RB43 
514 
Begin postprocessing 


Instruction is double- 


44 
2.0 
55 
0 
DUMMY 
5R43.RB43 
precision RA + RB. allows 


time for answer to 


propagate to the Y bus 


45 
2.0 
55 
0 
NOP 
P14 
P14 
Output M5H of answer 


46 
2.0 
55 
0 
NOP 
P14 
P14 
Output l5H 
of answer 


-..J 
.i:> 
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Microcode Table for the Sine(x) Calculation 


All numbers are in hex. Any field with a length that is not a multiple of 4 is right justified and zero filled. For the microcode 
table, the value of X has been chosen to be 1/2 pi. 


P 
0 
0 
PEE 
C 
P 
C C s 
R H E 
F 
I 
R F S B S T S 000 
A 
A 
B 
B 
N N L 
I 
L 0 
E 
E A N 
L 
N 
N A RYE 
E E E E E 
A B K 
P 
K N 
L 
S 
L C 0 
S 
o S C T 
L SLY 
S C 
C 
EMF 
0 
E T 
W 
T 
T C EST 
Y 
SOl 
P 
T 
C 
R 
P T 
o G 


F 3FF921FB 
54442018 
F 
0 0 _ 
2 
0 
3 
FF 
1 
1 1 o 1CO 
000 
0 
3 3 
1 000 
F 3FE45F30 
60C9C883 
F 
1 1 _ 
2 o 3 
FF 
1 
1 1 o 1CO 
000 
0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 1 
1 1 o 180 
o 0 003 
3 
1 000 
F 3FFOOOOO 00000000 
F01_2 
o 3 
FB 1 
1 1 o 180 
o 0 003 
3 
1 000 
F 3FOOOOOO00000000 
F01..r2 
1 3 
BF 1 
1 0 
0 
1CO 
001033100 
0 
F 3FFOOOOO 00000000 
FOO_2 
o 3 
FB 1 
1 1 o 1A3 
1 0 0 0 
3 
3 
1 000 
F 00000000 
00000000 
F 
1 0 _ 
2 o 3 
FB 1 
1 1 o 1A3 
1 0 0 0 
3 
3 
1 000 
F 00000000 
00000004 
F01_2 
o 1 
BF 1 
1 1 o 240 
o 0 0 0 
3 
3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
FB 1 
1 1 o 1A2 
o 0 003 
3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
F6 
1 
1 1 o 181 
o 000 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
FE 1 
1 1 o 182 
o 000 
3 3 
1 000 
F 00000000 
00000000 
FOOI2 
o 3 
FF 
1 
1 0 
0 300 
o 000 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
F7 
1 
1 1 o 1AO o 0 0 0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
5F 
1 
1 1 o 1CO 
000 
0 
3 3 
1 000 
F 40000000 
00000000 
FOOS2 
o 3 
EF 1 
1 0 
0 
1CO o 0 0 0 
3 
3 
1 000 
F 00000000 
00000000 
F 
1 0 _ 
2 o 3 
EF 1 
1 1 o 1CO o 0 003 
3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 1 
1 1 o 180 
o 000 
3 
3 
1 000 
F BFFOOOOO00000000 
F01_2 
o 3 
FB 1 
1 1 o 180 
o 0 0 0 
3 
3 
1 000 
F 3D19046B 
704C8F32 
F 
0 
1 _ 
2 
1 3 
BF 1 
1 1 o 1CO o 0 0 0 
3 3 
1 000 
F 00000000 
00000000 
FOOI2 
o 3 
FB 1 
1 0 
0 
180 
o 0 0 0 
3 3 
1 000 
F B0962909 
C5C01 E06 
F 
0 
1 _ 
2 o 3 
FB 1 
1 1 o 180 
o 0 0 0 
3 
3 
1 000 


Microcode Table for the Sine(x) Calculation (Continued) 


p 
D 
D 
PEE 
C 
P 
C C s 
R H E 
F 
I 
R F S B S T S 000 
A 
A 
B 
B N N L 
I 
L 0 
E 
E A N 
L 
N 
N A RYE 
E E E E E 
A B K 
P 
K N 
L 
S 
L C 
0 
S 
D S C T L SLY 
S C 
C 
EMF 
0 
E T 
W 
T 
T C EST 
Y 
SOl 
P 
T 
C 
R 
P T 
D G 


F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 0 0 331 
000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 
1 
1 1 o 180 o 0 0 0 3 3 1 000 
F 3EOD5351 
7735F927 
F 0 1 _ 
2 
0 3 
FB 
1 
1 1 o 180 o 0 0 0 3 3 1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 0 0 3 3 1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FB 
1 
1 1 o 180 o 0 0 0 3 3 1 000 
F BE7CC930 
FDOADA9D 
F 0 1 _ 
2 o 3 
FB 
1 
1 1 o 180 o 0 0 0 3 3 1 000 
F 00000000 
00000000 
F 00_ 
2 
1 3 
9F 
1 
1 1 o 1CO 
000 
0 3 3 1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FB 
1 
1 1 o 180 o 0 0 0 3 3 1 000 
F 3EE3EOAF 
61F7677F 
F 0 1 _ 
2 
0 3 
FB 
1 
1 1 o 180 
000 
0 3 3 1 000 
F 00000000 
00000000 
F 00_ 
2 
1 3 
9F 
1 
1 1 o 1CO o 0 0 0 3 3 1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FB 
1 
1 1 o 180 
000 
0 3 3 1 000 
F BF41E5FD 
EF25C403 
F 0 1 _ 
2 
0 3 
FB 
1 
1 1 o 180 o 0 0 0 3 3 1 000 
F 00000000 
00000000 
F 00_ 
2 
1 3 
9F 
1 
1 1 o 1CO 
0000331 
000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 
1 
1 1 o 180 o 0 0 0 3 3 1 000 
F 3F92A9FB 
40C119ED 
F 0 1 _ 
2 
0 3 
FB 
1 
1 1 o 180 o 0 0 0 3 3 1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 0 0 3 3 1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FB 
1 
1 1 o 180 o 0 0 0 3 3 1 000 
F BFD23B03 
366AAOC9 
F 0 1 _ 
2 
0 3 
FB 
1 
1 1 o 180 
000 
0 3 3 1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 0 0 3 3 1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FB 
1 
1 1 o 180 
000 
0 3 3 1 000 
F 3FF4464B 
CC8CBA1F 
F 0 1 _ 
2 o 3 
FB 
1 
1 1 o 180 o 0 003 
3 1 000 
F 3FFOOOOO 
00000000 
F01_2 
1 3 
BF 
1 
1 1 o 1CO o 0 0 0 3 3 1 0 ~ 
0 
F 00000000 
00000000 
FOO_2 
o 3 
FF 
1 
1 1 o 180 o 0 0 0 3 3 1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FF 
1 
1 1 o 300 
o 0 0 0 3 3 1 000 
-...J 
F 00000000 
00000000 
~ 
FOO_2 
o 3 
FF 
1 
1 1 o 300 
o 0 0 0 3 300 
0 0 
....• 


Tangent Routine Using Chebyshev's 
Method 
All floating-point 
inputs and outputs are double precision. The input is in radians. 


Steps Required to Perform the Calculation 


STEP 1 - 
Preprocessing; range reduce the input, X, to a range of [-1,1]. 
Next 
squarethis range-reducedvalue, multiply it by 2.0, and finally subtract 
1.0. X3 is the range-reducedinput value, it must be stored externally. 
'TRUNC' means to truncate. If X2 > 1.0, then in the postprocessing 
part of the routine, the answer is the reciprocal of X5 * X3. 


X1 +- X*(4.0/pi) 
X2 +- X1 - 
(4(TRUNC(0.25(X1 
+ 1.0)))) 
If X2 > 1.0 
Then X3 +- 2.0 - 
X2 
Else X3 +- X2 
X4 +- 2.0*(X3*X3) 
- 
1.0 


STEP 2 - 
Core Calculation; X4 in Step 1 will be referred to as 'x' in the core 
calculation. 


+- (((((((((((((C14)*X + C13)*x + C12)*X + C11)*x + C1Q)*x + 


cg)*x 
+ C8)*x + C7)*x + C6)*x + C5)*x + C4)*X + C3)*X + 
C2)*x + c1)*x + cO 


STEP 3 - 
Postprocessing; multiply the output of the core calculation times 
X3. If X2 > 1.0, then the reciprocal of X5*X3 
is the answer, if 
X2 
:5 1.0 then X5*X3 
is the answer. 


Tangent(X) +- X5*X3 
(or reciprocal of X5*X3) 


Algorithms for the Three Steps 


T1 
+-X*(4.0/pi) 
T2 
+-T1 + 1.0 
T3 
+-0.25*T2 
T4 
+-INT(T3) 
T5 
+-4*T4 
T6 
+-DOUBLE(T5) 
T7 
+-CREG +- T6 
CMP (1.0, T7) 
If (1.0 > T7) 
Then T8 +- 2.0 - 
CREG 
Else T8 +- CREG 


CREG +- T1 
round controls set to truncate 


CREG +- T7 
T8 is X3 in Step 1, must 
be stored externally 


T9 
+-CREG*CREG 
T10 +-T9*2.0 
T11+-T10 
- 
1.0 


T12 +-C14*CREG 
T13+-T12 
+ c13 
T14 +-T13*CREG 
T15+-T14 
+ c12 
T16 +-T15*CREG 
T17+-T16 
+ c11 
T18 +-T17*CREG 
T19+-T18 
+ c10 
T20 +-T19*CREG 
T21 +-T20 + c9 
T22 +-T21 *CREG 
T23 +-T22 + c8 
T24 +-T23*CREG 
T25 +-T24 + c7 
T26 +-T25*CREG 
T27 +-T26 + c6 
T28 +-T27*CREG 
T29 +-T28 + c5 
T30 +-T29*CREG 
T31 +-T30 + c4 
T32 +-T31 *CREG 
T33 +-T32 + c3 
T34 +-T33*CREG 
T35 +-T34 + c2 
T36 +-T35*CREG 
T37 +-T36 + c1 
T38 +-T37*CREG 
T39 +-T38 + cO 


T40 +-T39*T8 
If X2 (in Step 1) > 1.0 
Then Tangent(X) +- 1.0/T40 
Else Tangent(X) +- T40 


T11 is X4 in Step 1, the 
input to the core routine 


Required System Intervention 


As seen in the algorithm for Step 1, the' ACT884 7 performs a compare. The results 
of this compare determine which one of two calculations is to be performed. The 
system, in which the' ACT884 7 is a part, must make the decision as to which of the 
two calculations is to be performed. In addition, the system must store X3 and then 
later furnish X3 as an input to the' ACT884 7. Finally, the system will have to determine 
if it is necessary to take the reciprocal of the final product (T40 in the Algorithm for 
Step 3) to yield the answer. If it is necessary to take the reciprocal, then the system 
will be required to direct the variable T40 from the' ACT884 7' s output bus to the input 
buses. This is because operands for division instructions must be provided by the RA 
and RB registers; feedback is not an option. 


Calculation of Tangent(x) 
requires 79 cycles. In addition, it is assumed that five 
additional cycles are required for system intervention due to the compare instruction. 
Therefore, the total number of cycles required to perform the Tangent(x) calculation 
is 84. 


c14 = 3D747D842210CC35 
c13 = 3DA 1D66636043991 
c12 = 3DCCD078F52B3A 73 
C11 = 3DF938F9CDDFF864 
c10 = 3E2620430E99B5B7 
c9 = 3E535C2C953CE515 
c8 = 3E80F07 AFC099D7F 
C7 = 3EADA4D789EB45C4 
c6 = 3ED9F03D4C51A771 
c5 = 3F06B236DE4D014C 
c4 = 3F33DBFB01B3F415 
c3 = 3F6160DE701 F3A53 
c2 = 3F8E70A18736FC10 
C1 = 3FBAEA2653199611 
co = 3FEC14B2675B10BA 


ClK 
DA 
DB 
RA 
RB 
ClK 
MUl 
AlU 
P 
C 
S 
y 
COMMENT 
INSTR 
BUS 
BUS 
REG 
REG 
MODE 
PIPE 
PIPE 
REG REG 
REG BUS 


1 
X M5H 
X l5H 
0 
RA2.RB2 
X is the input 


4DIVPI 
4DIVPI 
4DIV 
4DIVPI is a constant 
2 
M5H 
l5H 
X 
PI 
0 
RA2.RB2 
representing 4.0/pi 


3 
X 
4DIVPI 
0 
PR4+RB4 
RA2.RB2 


4 
1.0 M5H 
1.0 L5H 
X 
1.0 
0 
PR4+RB4 
P1 


5 
0.25 M5H 
0.25 L5H 
X 
0.25 
1 
5R5.RB5 
P1 
51 


6 
1.0 M5H 
1.0 l5H 
X 
0.25 
0 
DP2I(PR7) 
5R5.RB5 
Double precision 
-+ integer 


7 
1.0 
0.25 
0 
DP2I(PR7) 
P2 
Cycles 6.7 set RND1,O = 01 


8 
4 
1.0 
4 
0 
5R8.RB8 
52 


9 
1.0 
4 
1 
12DP(PR9) 
P3 
Integer 
-+ double precision 


10 
1.0 
4 
1 
CR10-5R10 
53 


COMPARE 
54 
If 5R11 > RA11 then 13a 
11 
1.0 
4 
1 
If 5R 11 :s RA11 then 13b 
RA11,5R11 


12 
1.0 
4 
0 
NOP 
54 
Wait for system response 


13a 
2.0 M5H 
2.0 L5H 
1.0 
2.0 
1 
RB13-CR13 
Execute 13a or 13b 


13b 
1.0 
4 
1 
PA5(CR13) 
Pass contents 
of Creg 


55 is either RB13-CR13 or 


2.0 
55 
55 
CR13 from PA55 CR13, and 
14 
1.0 
1 
CR14.CR14 
must be stored externally 
or 4 
for use in cycle 61 


2.0 M5H 
2.0 L5H 
1.0 
2.0 
0 
RA16.PR16 
CR14.CR14 
55 
55 
Output 55 in cycles 14 and 
15 
or 4 
15 


2.0 
RA16.PR16 
P4 
16 
2.0 
0 
or 4 


2.0 
2.0 
0 
PR18+RB18 
RA16.PR16 
17 
or 4 


18 
-1.0 
M5H 
-1.0 
l5H 
2.0 
-1.0 
0 
PR18+RB18 
P5 


ClK 
DA 
DB 
RA 
RB 
ClK 
MUl 
AlU 
P 
C 
S 
y 
COMMENT 
BUS 
INSTR 
REG 
REG 
BUS 
BUS 
REG 
REG 
MODE 
PIPE 
PIPE 
REG 


19 
c14 MSH 
c14 lSH 
2.0 
c14 
1 
SR19.RB19 
S6 
Start core calculation 


20 
2.0 
c14 
0 
PR21 +RB21 
SR19.RB19 
S6 
S7 is input to core calc. 


21 
c13 MSH 
c13 LSH 
2.0 
c13 
0 
PR21 +RB21 
P6 


22 
2.0 
c13 
1 
SR22.CR22 
S7 


23 
2.0 
c13 
0 
PR24+RB24 
SR22.CR22 


24 
c12 MSH 
c12 LSH 
2.0 
c12 
0 
PR24+RB24 
P7 


25 
2.0 
c12 
1 
SR25.CR25 
S8 


26 
2.0 
c12 
0 
PR27 +RB27 
SR25.CR25 


27 
c11 MSH 
c11 LSH 
2.0 
c11 
0 
PR27 +RB27 
P8 


28 
2.0 
c11 
1 
SR28.CR28 
S9 


29 
2.0 
c11 
0 
PR30+RB30 
SR28.CR28 


30 
c10 MSH 
c10 LSH 
2.0 
c10 
0 
PR30+RB30 
P9 


31 
2.0 
c10 
1 
SR31.CR31 
S10 


32 
2.0 
c10 
0 
PR33+RB33 
SR31.CR31 


33 
c9 MSH 
c9 LSH 
2.0 
c9 
0 
PR33+RB33 
P10 


34 
2.0 
c9 
1 
SR34.CR34 
S11 


35 
2.0 
c9 
0 
PR36+RB36 
SR34.CR34 


36 
c8 MSH 
c8 LSH 
2.0 
c8 
0 
PR36+RB36 
P11 


37 
2.0 
c8 
1 
SR37.CR37 
S12 


38 
2.0 
c8 
0 
PR39+RB39 
SR37.CR37 


39 
c7 MSH 
c7 LSH 
2.0 
c7 
0 
PR39+RB39 
P12 


40 
2.0 
c7 
1 
SR40.CR40 
S13 


41 
2.0 
c7 
0 
PR42+RB42 
SR40.CR40 


42 
c6 MSH 
c6 LSH 
2.0 
c6 
0 
PR42 +RB42 
P13 


CLK 
DA 
DB 
RA 
RB 
CLK 
MUL 
ALU 
P 
C 
S 
y 


BUS 
BUS 
REG 
REG 
MODE 
INSTR 
REG 
REG 
REG 
BUS 
COMMENT 
PIPE 
PIPE 


43 
2.0 
c6 
1 
SR43.CR43 
S14 


44 
2.0 
c6 
0 
PR45+RB45 
SR43.CR43 


45 
c5 MSH 
c5 LSH 
2.0 
c5 
0 
PR45+RB45 
P14 


46 
2.0 
c5 
1 
SR46.CR46 
S15 


47 
2.0 
c5 
0 
PR48+RB48 
SR46.CR46 


48 
c4 MSH 
c4 LSH 
2.0 
c4 
0 
PR48+RB48 
P15 


49 
2.0 
c4 
1 
SR49.CR49 
S16 


50 
2.0 
c4 
0 
PR51 +RB51 
SR49.CR49 


51 
c3 MSH 
c3 LSH 
2.0 
c3 
0 
PR51 +RB51 
P16 


52 
2.0 
c3 
1 
SR52.CR52 
S17 


53 
2.0 
c3 
0 
PR54+RB54 
SR52.CR52 


54 
c2 MSH 
c2 LSH 
2.0 
c2 
0 
PR54+RB54 
P17 


55 
2.0 
c2 
1 
SR55.CR55 
S18 


56 
2.0 
c2 
0 
PR57 + RB57 
SR55.CR55 


57 
c1 MSH 
c1 LSH 
2.0 
c1 
0 
PR57+RB57 
P18 


58 
2.0 
c1 
1 
SR58.CR58 
S19 


59 
2.0 
c1 
0 
PR60+RB60 
SR58.CR58 


60 
cO MSH 
cO LSH 
2.0 
cO 
0 
PR60+RB60 
P19 


61 
S5 MSH 
S5 LSH 
2.0 
S5 
1 
SR61.RB61 
S20 
Begin postprocessing 


Instruction 
is RA + RB. used 


62 
2.0 
S5 
0 
DUMMY 
SR61.RB61 
to allow 
time for result 


to propagate 
to Y bus 


CLK 
DA 
DB 
RA 
RB 
CLK 
MUL 
ALU 
P 
C 
S 
Y 
BUS 
BUS 
REG 
REG 
MODE 
INSTR 
REG 
COMMENT 
PIPE 
PIPE 
REG 
REG 
BUS 


Output M5H, if cycle 13b 


was executed then P20 is 


63 
2.0 
55 
0 
NOP 
P20 
P20 
the answer; if cycle 13a 


was executed then the 


answer is 1.0/P20, which 


is calculated next 


64 
1.0 M5H 
1.0 LSH 
2.0 
55 
0 
DIV 
P20 
Output L5H 


Operands for Division must 
65 
P20 M5H 
P20 L5H 
1.0 
P20 
0 
DIV 
come from RA and RB, 


feedback is not an option 


66 
1.0 
P20 
0 
NOP 
Wait for Division result 


67 
1.0 
P20 
0 
NOP 
Wait for Division result 


68 
1.0 
P20 
0 
NOP 
Wait for Division result 


69 
1.0 
P20 
0 
NOP 
Wait for Division result 


70 
1.0 
P20 
0 
NOP 
Wait for Division result 


71 
1.0 
P20 
0 
NOP 
. 
Wait for Division result 


72 
1.0 
P20 
0 
NOP 
Wait for Division result 


73 
1.0 
P20 
0 
NOP 
Wait for Division result 


74 
1.0 
P20 
0 
NOP 
Wait for Division result 


75 
1.0 
P20 
0 
NOP 
Wait for Division result 


76 
1.0 
P20 
0 
NOP 
Wait for Division result 


77 
1.0 
P20 
0 
NOP 
Wait for Division result 


78 
1.0 
P20 
0 
NOP 
P21 
P21 
Output M5H of answer 


79 
1.0 
P20 
0 
NOP 
P21 
P21 
Output L5H of answer 


Microcode Table for the Tangent(x) Calculation 


All numbers are in hex. Any field with a length that is not a multiple of 4 is right justified and zero filled. For the microcode 
table, the value of X has been chosen to be 1/3 pi. 


P 
0 
0 
PEE 
C 
P 
C C s 
R H E 
F 
I 
R F S B S T S 000 
A 
A 
B 
B 
N N L 
I 
L 0 
E 
E A N 
L 
N 
N A RYE 
E E E E E 
A B K 
P 
K N 
L 
S 
L C 
0 
S 
o S C T 
L SLY 
S C 
C 
EMF 
0 
E T 
W 
T 
T C EST 
Y 
SOl 
P 
T 
C 
R 
P T 
o G 


F 3FFOC152 
38207365 
F 
0 0 _ 
2 o 3 
FF 
1 
1 1 o 1CO o 0 0 0 
3 
3 
1 000 
F 3FF45F30 
60C9C883 
F 
1 1 _ 
2 o 3 
FF 
1 
1 1 o 1CO o 0 0 0 
3 
3 
1 000 
F 00000000 
00000000 
F 
0 0 _ 
2 
0 
3 
FB 1 
1 1 o 180 
000033100 
0 
F 3FFOOOOO 00000000 
F01_2 
o 3 
FB 1 
1 1 o 180 
o 0 0 
0 
3 
3 
1 000 
F 3FOOOOOO00000000 
F01S2 
1 3 
BF 1 
1 0 
0 
1CO 
001033100 
0 
F 3FFOOOOO 00000000 
FOO_2 
o 3 
FB 1 
1 1 o 1A3 
1 000 
3 
3 
1 000 
F 00000000 
00000000 
F 
1 0 _ 
2 o 3 
FB 1 
1 1 o 1A3 
1 000 
3 
3 
1 000 
F 00000000 
00000004 
F01_2 
o 1 
BF 1 
1 1 o 240 
o 0 003 
3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
FB 1 
1 1 o 1A2 
0000331 
000 
F 00000000 
00000000 
FOO_2 
1 3 
F6 
1 
1 1 o 181 
000 
0 
3 3 
1 000 
F 00000000 
00000000 
F 
00_ 
2 
1 3 
FE 
1 
1 1 o 182 
000033100 
0 
F 00000000 
00000000 
F 
0 O.I 
2 
0 
3 
FF 
1 
1 0 
0 300 
000033100 
0 
F 40000000 
00000000 
F01_2 
1 3 
F7 
1 
1 1 o 183 
0000331 
000 
F 00000000 
00000000 
F 
00_ 
2 
1 3 
5F 
1 
1 1 o 1CO 
0000331 
000 
F 40000000 
00000000 
F 
0 0 I 
2 o 3 
EF 1 
1 0 
0 
1CO o 0 0 0 
3 3 
1 000 
F 00000000 
00000000 
F 
1 0 _ 
2 o 3 
EF 1 
1 1 o 1CO o 0 0 0 
3 3 
1 000 
F 00000000 
00000000 
F 
00_ 
2 o 3 
FB 1 
1 1 o 180 
o 0 0 0 
3 3 
1 000 
F BFFOOOOO00000000 
F01_2 
o 3 
FB 1 
1 1 o 180 
o 0 0 0 
3 
3 
1 000 
F 30747084 
2210CC35 
F 
0 
1 _ 
2 
1 3 
BF 1 
1 1 o 1CO o 0 0 0 
3 3 
1 000 
F 00000000 
00000000 
FOOI2 
o 3 
FB 1 
1 0 
0 
180 
o 0 0 0 
3 3 
1 000 


-..J 
F 30A10666 
36043991 
F 
0 
1 _ 
2 
0 
3 
FB 1 
1 1 o 180 
o 0 003 
3 
1 000 


.i:. 
(0 


';'I 


(110 
Microcode Table for the Tangent(x) Calculation (Continued) 


p 
0 
0 
PEE 
C 
P 
C C s 
R H E 
F 
I 
R F S B S T S 555 
A 
A 
B 
B N N L 
I 
L 0 
E 
E A N 
L 
N 
N A RYE 
E E E E E 
A B K 
P 
K N 
L 
S 
L C 0 
S 
o S C T 
L SLY 
S C 
C 
EMF 
0 
E T 
W 
T 
T C EST 
Y 
SOl 
P 
T 
C 
R 
P T 
o G 


F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO 
0000331 
000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 1 
1 1 o 180 
o 0 0 0 
3 3 
1 000 
F 3DCCD078 F52B3A73 
F01_2 
o 3 
FB 1 
1 1 o 180 
o 0 003 
3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 003 
3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 1 
1 1 o 180 
0000331 
000 
F 3DF938F9 
CDDFF864 
F 
0 
1 _ 
2 o 3 
FB 1 
1 1 o 180 
0000331 
000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO 0000331 
000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 
3 
FB 1 
1 1 o 180 
o 000 
3 3 
1 000 
F 3E262043 
OE99B5B7 
F01_2 
o 3 
FB 1 
1 1 o 180 
0000331 
000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO 
0000331 
000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 
3 
FB 1 
1 1 o 180 
0000331 
000 
F 3E535C2C 
953CE515 
F01_2 
o 3 
FB 1 
1 1 o 180 
0000331 
000 
F 00000000 
00000000 
F 00_ 
2 
1 3 
9F 
1 
1 1 o 1CO 
0000331 
000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 
3 
FB 1 
1 1 o 180 
0000331 
000 
F 3E80F07A 
FC099D7F 
F01_2 
o 3 
FB 1 
1 1 o 180 
000033100 
0 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO 0000331 
000 
F 00000000 
00000000 
F 
0 0 _ 
2 
0 3 
FB 1 
1 1 o 180 
o 0 0 0 3 3 
1 000 
F 3EADA4D7 89EB45C4 
F 
0 
1 _ 
2 
0 
3 
FB 1 
1 1 o 180 
o 000 
3 3 
1 000 
F 00000000 
00000000 
F 00_ 
2 
1 3 
9F 
1 
1 1 o 1CO o 000 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 1 
1 1 o 180 
o 000 
3 3 
1 000 
F 3ED9F03D 
4C51A771 
F 
0 
1 _ 
2 
0 
3 
FB 1 
1 1 o 180 
000033100 
0 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO 000033100 
0 
F 00000000 
00000000 
F 
0 0 _ 
2 
0 
3 
FB 1 
1 1 o 180 
000033100 
0 
F 3F06B236 
DE4D014C 
F 
0 
1 _ 
2 
0 
3 
FB 1 
1 1 o 180 
0000331 
000 


Microcode Table for the Tangent(x) Calculation (Continued) 


p 
0 
0 
PEE 
C 
P 
C C s 
R H E 
F 
I 
R F S B S T S 000 
A 
A 
B 
B 
N N L 
I 
L 0 
E 
E A N 
L 
N 
N A RYE 
E E E E E 
A B K 
P 
K N 
L 
S 
L C 0 
S 
o S C T 
L SLY 
S C 
C 
EMF 
0 
E T 
W 
T 
T C EST 
Y 
SOl 
P 
T 
C 
R 
P T 
o G 


F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 0 0 
3 3 
1 000 
F 00000000 
00000000 
F 
0 0 _ 
2 
0 3 
FB 1 
1 1 o 180 
o 0 0 0 
3 3 
1 000 
F 3F33DBFB 
01B3F415 
F01_2 
o 3 
FB 1 
1 1 o 180 
o 0 0 0 3 3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 0 0 3 3 
1 000 
F 00000000 
00000000 
F 
0 0 _ 
2 
0 3 
FB 1 
1 1 o 180 
o 0 0 0 3 3 
1 000 
F 3F6160DE 
701F3A53 
F 
0 
1 _ 
2 
0 3 
FB 1 
1 1 o 180 
000 
0 3 3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO 000 
0 3 3 
1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FB 1 
1 1 o 180 
000 
0 3 3 
1 000 
F 3F8E70A1 
8736FC10 
F 
0 
1 _ 
2 
0 3 
FB 1 
1 1 o 180 
0000331 
000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 0 0 3 3 
1 000 
F 00000000 
00000000 
F 
0 0 _ 
2 
0 3 
FB 1 
1 1 o 180 
0000331 
000 
F 3FBAEA26 
53199611 
F 
0 
1 _ 
2 
0 3 
FB 1 
1 1 o 180 
000033100 
0 
F 00000000 
00000000 
F 
00_ 
2 
1 3 
9F 
1 
1 1 o 1CO 0000331 
000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FB 1 
1 1 o 180 
000 
0 3 3 
1 000 
F 3FEC14B2 
675B10BA 
F 
0 
1 _ 
2 o 3 
FB 1 
1 1 o 180 
o 0 003 
3 
1 000 
F 3FE55555 
55555555 
F01_2 
1 3 
BF 1 
1 1 o 1CO o 0 0 0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FF 
1 
1 1 o 180 
o 0 0 0 3 3 
1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FF 
1 
1 1 o 300 
o 0 0 0 3 3 
1 000 
F 3FFOOOOO00000000 
FOO_2 
o 3 
FF 
1 
1 1 o 1EO o 0 0 0 3 300 
0 0 
F 3FE279A7 
45903310 
F 
1 1 _ 
2 o 3 
FF 
1 
1 1 o 1EO o 0 0 0 3 3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FF 
1 
1 1 o 300 
o 0 003 
3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FF 
1 
1 1 o 300 
o 0 003 
3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FF 
1 
1 1 o 300 
o 0 003 
3 
1 000 


';'J 
F 00000000 
00000000 
F 
0 0 _ 
2 
0 
3 
FF 
1 
1 1 o 300 
o 0 003 
3 
1 000 


(]I~ 


-...J, 
(jI 
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Microcode Table for the Tangent(x) Calculation (Concluded) 


p 
D 
D 
PEE 
C 
pee 
s 
R H E 
F 
I 
R F S B S T S 000 
A 
A 
B 
B N N L 
I 
L 0 
E 
E A N 
L 
N 
N A RYE 
E E E E E 
A B K 
P 
K N 
L 
S 
Leo 
S 
D seT 
L SLY 
S C 
C 
EMF 
0 
E T 
W 
T 
TeE 
STY 
SOl 
P 
T 
C 
R 
P T 
D G 


F 00000000 
00000000 
FOO_2 
o 3 
FF 
1 
1 1 o 300 
o 0 0 0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FF 1 
1 1 o 300 
o 0 0 0 
3 3 
1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FF 1 
1 1 o 300 
000 
0 3 3 
1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FF 
1 
1 1 o 300 
000033100 
0 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FF 1 
1 1 o 300 
o 0 0 0 
3 3 
1 000 
F 00000000 
00000000 
F 00_ 
2 o 3 
FF 1 
1 1 o 300 
o 0 0 0 3 3 
1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FF 1 
1 1 o 300 
o 0 003 
3 
1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FF 
1 
1 1 o 300 
o 0 003 
3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FF 1 
1 1 o 300 
o 0 0 0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FF 1 
1 1 o 300 
o 0 0 0 
3 3 0 0 0 0 


STEP 1 - 
Preprocessing; range reduction is not needed, because an input, X, 
outside the range of [- 
1,11 indicates an error. This routine requires 
that the X2 be less than or equal to 1/2. The first operation to be 
performed is to square X, then multiply it by 4.0, and finally subtract 
1.0. 


STEP 2 - 
Core Calculation; X1 in Step 1 will be referred to as 'x' in the core 
calculation. 


+- ((((((((((((((((C1S*X +C17)*x 
+ C16)*x 
+ 
c15*x 
+ C14)*X + C13)*x 
+ C12)*x 
+ C11)*x + C10)*x 
+ 
cg)*x 
+ CS)*x +C7)*x 
+ C6)*x + C5)*X + C4)*X + C3)*x + 
c 2) * x + C1) * x + cO 


STEP 3 - 
Postprocessing; multiply the output of the core calculation times 
SQRT(2.01. then multiply this product by X, the original input. This 
yields ArcSine(X). To calculate ArcCosine(Xl. the following identity 
is used: 


X3 +- X2*SQRT(2.0) 
ArcSine(X) +- X3*X 
ArcCosine(X) +- pi/2 - 
ArcSine(X) 


Algorithms for the Three Steps 


Step 1 perform the preprocessing: 


T1 +-X*X 
T2 +-4.0*T1 
T3 +-T2 - 
1 
T3 is X1 in Step 1, the input to the core 
routine 


T4 
'-C18*CREG 
T5 
.-T4 
+ c17 
T6 
'-T5*CREG 
T7 
'-T6 
+ c16 
T8 
.-T7*CREG 
T9 
'-T8 
+ c15 
T10 '-T9*CREG 
T11 .-T10 
+ c14 
T12 .-T11 *CREG 
T13.-T12 
+ c13 
T14 '-T13*CREG 
T15.-T14 
+ c12 
T16.-T15*CREG 
T17'-T16 
+ c11 
T18 .- T17 *CREG 
T19.-T18 
+ clO 
T20 '-T19*CREG 
T21 '-T20 
+ c9 
T22 .- T21 *CREG 
T23 '-T22 
+ c8 
T24 .- T23 *CREG 
T25 '-T24 
+ c7 
T26 '-T25*CREG 
T27.-T26 
+ c6 
T28 '-T27*CREG 
T29 '-T28 
+ c5 
T30 '-T29*CREG 
T31 '-T30 
+ c4 
T32 .-T31 *CREG 
T33 .- T32 + c3 
T34 '-T33*CREG 
T35 '-T34 
+ c2 
T36 '-T35*CREG 
T37.-T36 
+ c1 
T38.-T37*CREG 
T39 .- T38 + cO 


T40 .- X*T39 
ArcSine(X) .- T40*SQRT(2.0) 
ArcCosine(X) .- pil2 - 
ArcSine(X) 


Number of 'ACT884 7 Cycles Required to Calculate ArcSine(x) and 
ArcCosine(x) 


The total 
number of cycles required to perform the ArcSine(x) 
and ArcCosine(x) 
calculation 
is 68. 


c18 = 3DA4A49F8CCD9E73 
c17 = 3DC05DFE52AAD200 
c16 = 3DCCF31 E26F94C8D 
c15 = ~DE86CDA3C8CAEBO 
c14 = 3E0768D9F4E950EA 
c13 = 3E2383A37598FC80 
c12 = 3E403E4B2F65FODE 
c11 = 3E5BAFC8245ABDF8 
c10 = 3E77E3333AFF1AB4 
c9 = 3E94E3A4D4220C9C 
c8 = 3EB296DD4C084ACB 
c7 = 3EDOE913F5F9D496 
c6 = 3EEFA74E896F8FA8 
c5 = 3FOEC76B7832DBB6 
c4 = 3F2F978698C8B2E4 
c3 = 3F519B1087542073 
c2 = 3F7696895FFC05AO 
c1 = 3FA375CA61 D2988C 
cO = 3FE7B20423D1D930 


CLK 
DA 
DB 
RA 
RB 
CLK 
MUL 
ALU 
P 
C 
S 
Y 
BUS 
BUS 
REG 
REG 
MODE 
INSTR 
REG 
REG 
BUS 
COMMENT 
PIPE 
PIPE 
REG 


1 
X MSH 
X LSH 
0 
RA2.RB2 
X is the input 


2 
X MSH 
X LSH 
X 
X 
0 
RA2.RB2 


3 
4.0 MSH 
4.0 LSH 
X 
X 
0 
RA4.PR4 
RA2.RB2 


4 
4.0 
X 
0 
RA4.PR4 
Pl 


5 
4.0 
X 
0 
PR6+RB6 
RA4.PR4 


6 
-1.0 
MSH 
-1.0 
LSH 
4.0 
-1.0 
0 
PR6+RB6 
P2 


7 
c18 MSH 
c18 LSH 
4.0 
c18 
1 
SR7.RB7 
Sl 
Start core calculation 


8 
4.0 
c18 
0 
PR9+RB9 
SR7.RB7 
Sl 
Sl 
is input to core calc. 


9 
c17 MSH 
c17 LSH 
4.0 
c17 
0 
PR9+RB9 
P3 


10 
4.0 
c17 
1 
SR10.CR10 
S2 


11 
4.0 
c17 
0 
PR12+RB12 
SR10.CR10 


12 
c16 MSH 
c16 LSH 
4.0 
c16 
0 
PR12+RB12 
P4 


13 
4.0 
c16 
1 
SR13.CR13 
S3 


14 
4.0 
c16 
0 
PR15+RB15 
SR13.CR13 


15 
c15 MSH 
c15 LSH 
4.0 
c15 
0 
PR15+RB15 
P5 


16 
4.0 
c15 
1 
SR16.CR16 
S4 


17 
4.0 
c15 
0 
PR18+RB18 
SR16.CR16 


18 
c14 MSH 
c14 LSH 
4.0 
c14 
0 
PR18+RB18 
P6 


19 
4.0 
c14 
1 
SR19.CR19 
S5 


20 
4.0 
c14 
0 
PR21 +RB21 
SR19.CR19 


21 
c13 MSH 
c13 LSH 
4.0 
c13 
0 
PR21 +RB21 
P7 


22 
4.0 
c13 
1 
SR22.CR22 
S6 


23 
4.0 
c13 
0 
PR24+RB24 
SR22.CR22 


ClK 
DA 
DB 
RA 
RB 
ClK 
MUl 
AlU 
P 
C 
S 
y 
COMMENT 
INSTR 
BUS 
BUS 
REG 
REG 
MODE 
PIPE 
PIPE 
REG 
REG 
REG 
BUS 


24 
c12 MSH 
c12 lSH 
4.0 
c12 
0 
PR24+RB24 
P8 


25 
4.0 
c12 
1 
SR25.CR25 
S7 


26 
4.0 
c12 
0 
PR27+RB27 
SR25.CR25 


27 
c11 MSH 
c11 LSH 
4.0 
c11 
0 
PR27+RB27 
P9 


28 
4.0 
c11 
1 
SR28.CR28 
S8 


29 
4.0 
c11 
0 
PR30+RB30 
SR28.CR28 


30 
clO MSH 
c10 LSH 
4.0 
clO 
0 
PR30+RB30 
P10 


31 
4.0 
clO 
1 
SR31.CR31 
S9 


32 
4.0 
clO 
0 
PR33+RB33 
SR31.CR31 


33 
c9 MSH 
c9 LSH 
4.0 
c9 
0 
PR33+RB33 
P11 


34 
4.0 
c9 
1 
SR34.CR34 
S10 


35 
4.0 
c9 
0 
PR36+RB36 
SR34.CR34 


36 
c8 MSH 
c8 LSH 
4.0 
c8 
0 
PR36+RB36 
P12 


37 
4.0 
c8 
1 
SR37.CR37 
Sll 


38 
4.0 
c8 
0 
PR39+RB39 
SR37.CR37 


39 
c7 MSH 
c7 LSH 
4.0 
c7 
0 
PR39+RB39 
P13 


40 
4.0 
c7 
1 
SR40.CR40 
S12 


41 
4.0 
c7 
0 
PR42+RB42 
SR40.CR40 


42 
c6 MSH 
c6 LSH 
4.0 
c6 
0 
PR42+ RB42 
P14 


43 
4.0 
c6 
1 
SR43.CR43 
S13 


44 
4.0 
c6 
0 
PR45+RB45 
SR43.CR43 


45 
c5 MSH 
c5 LSH 
4.0 
c5 
0 
PR45+RB45 
P15 


46 
4.0 
c5 
1 
SR46.CR46 
S14 


47 
4.0 
c5 
0 
PR48+RB48 
SR46.CR46 


48 
c4 MSH 
c4 lSH 
4.0 
c4 
0 
PR48+RB48 
P16 


CLK 
DA 
DB 
RA 
RB 
CLK 
MUL 
ALU 
P 
C 
S 
y 
COMMENT 
BUS 
BUS 
REG 
REG 
MODE 
INSTR 
PIPE 
PIPE 
REG 
REG 
REG 
BUS 


49 
4.0 
c4 
1 
SR49.CR49 
S15 


50 
4.0 
c4 
0 
PR51 +RB51 
SR49.CR49 


51 
c3 MSH 
c3 LSH 
4.0 
c3 
0 
PR51 +RB51 
P17 


52 
4.0 
c3 
1 
SR52.CR52 
S16 


53 
4.0 
c3 
0 
PR54+RB54 
SR52.CR52 


54 
c2 MSH 
c2 LSH 
4.0 
c2 
0 
PR54+RB54 
P18 


55 
4.0 
c2 
1 
SR55.CR55 
S17 


56 
4.0 
c2 
0 
PR57+RB57 
SR55.CR55 


57 
c1 MSH 
c1 LSH 
4.0 
c1 
0 
PR57 + RB57 
P19 


58 
4.0 
c1 
1 
SR58.CR58 
S18 


59 
4.0 
c1 
0 
PR60+RB60 
SR58.CR58 


60 
Co MSH 
cO LSH 
4.0 
Co 
0 
PR60+RB60 
P20 


61 
X MSH 
X LSH 
4.0 
X 
1 
SR61.RB61 
S19 
Begin postprocessing 


62 
SORT(2) 
SORT(21 
4.0 
X 
0 
RA63.PR63 
SR61.RB61 
SORT(2) is the real value 


MSH 
LSH 
of square root of 2.0 


63 
SORT 


2 
X 
0 
RA63.PR63 
P21 


Instruction 
is double- 


64 
SORT 
0 
RA63.PR63 
precision 
RA + RB, prevents 


2 
X 
DUMMY 
ArcCosine 
from 
over- 


writing 
ArcSine 
result 


66 
pi/2 MSH 
pi/2 LSH 
SORT 
pi/2 
RB66-PR66 
P22 
P22 
Output 
LSH of ArcSine 
2 
1 


67 
SORT 
pi/2 
0 
NOP 
S20 
S20 
Output 
MSH of ArcCosine 
2 


68 
SORT 
pi/2 
0 
NOP 
S20 
S20 
Output 
LSH of ArcCosine 
2 


Microcode Table for the Ar~Sine(x) and ArcCosine(x) Calculation 


All numbers are in hex. Any field with a length that is not a multiple of 4 is right justified and zero filled. For the microcode 
table, the value of X has been chosen to be 1/(SQRT(2.0)). 


p 
0 
0 
PEE 
C 
P 
C C 
S 
R H E 
F 
I 
R F S B S T S 555 
A 
A 
B 
B 
N N L 
I 
L 0 
E 
E A N 
L 
N 
N A RYE 
E E E E E 
A B K 
P 
K N 
L 
S 
L C 
0 
S 
o SeT 
L SLY 
S C 
C 
EMF 
0 
E T 
W 
T 
T C EST 
Y 
SOl 
P 
T 
C 
R 
P T 
o G 


F 3FE6A09E 
667F3BCD 
F 
0 0 _ 
2 
0 
3 
FF 
1 
1 1 o 1CO o 000 
3 3 
1 000 
F 3FE6A09E 
667F3BCD 
F 
1 1 _ 
2 o 3 
FF 
1 
1 1 o 1CO o 000 
3 
3 
1 000 
F 40100000 
00000000 
F 
0 0 _ 
2 
0 
3 
EF 1 
1 1 o 1CO 
0000331 
000 
F 00000000 
00000000 
F 
1 0 _ 
2 
0 
3 
EF 1 
1 1 o 1CO 
000 
0 
3 3 
1 000 
F 00000000 
00000000 
F 
0 0 _ 
2 
0 
3 
FB 1 
1 1 o 180 
0000331 
000 
F BFFOOOOO00000000 
F 
0 
1 _ 
2 
0 
3 
FB 1 
1 1 o 180 
0000331 
000 
F 30A4A49F 
8CCD9E73 
F 
0 
1 _ 
2 
1 3 
BF 1 
1 1 o 1CO o 0 0 0 
3 3 
1 000 
F 00000000 
00000000 
FOOI2 
o 3 
FB 1 
1 0 
0 
180 
o 000 
3 
3 
1 000 
F 3DC05DFE 
52AAD200 
F 
0 
1 _ 
2 
0 
3 
FB 1 
1 1 o 180 
o 0 
0 0 
3 
3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 
0 0 
3 
3 
1 000 
F 00000000 
00000000 
F 
0 0 _ 
2 
0 
3 
FB 1 
1 1 o 180 
o 0 
0 0 
3 3 
1 000 
F 3DCCF31E 26F94C8D 
F 
0 
1 _ 
2 
0 
3 
FB 1 
1 1 o 180 
o 0 
0 0 
3 
3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO 
0000331 
000 
F 00000000 
00000000 
F 
0 0 _ 
2 
0 
3 
FB 1 
1 1 o 180 
o 0 
0 0 
3 3 
1 000 
F 3DE86CDA 3C8CAEBO F 
0 
1 _ 
2 o 3 
FB 1 
1 1 o 180 
o 0 0 0 
3 
3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 
0 0 
3 
3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 1 
1 1 o 180 
0000331 
000 
F 3E0768D9 
F4E950EA 
F01_2 
o 3 
FB 1 
1 1 o 180 
o 0 
0 0 
3 
3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 003 
3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 1 
1 1 o 180 
o 0 
0 0 
3 
3 
1 000 


';'J 
(1l 
co 


-;J 


0)0 
Microcode Table for the ArcSine(x) and ArcCosine(x) Calculation (Continued) 


p 
D 
D 
PEE 
C 
P 
C C s 
R H E 
F 
I 
R F S B S T S 000 
A 
A 
B 
B N N L 
I 
L 0 
E 
E A N 
L 
N 
N A RYE 
E E E E E 
A B K 
P 
K N 
L 
S 
L C 
0 
S 
D S C T 
L SLY 
S C 
C 
EMF 
0 
E T 
W 
T 
T C EST 
Y 
SOl 
P 
T 
C 
R 
P T 
D G 


F 3E2383A3 
7598FC80 
F 0 
1 _ 
2 
0 
3 
FB 1 
1 1 o 180 
0000331 
000 
F 00000000 
00000000 
F 00_ 
2 
1 3 
9F 
1 
1 1 o 1CO 
0000331 
000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 1 
1 1 o 180 
o 0 
003 
3 
1 000 
F 3E403E4B 
2F65FODE 
F 0 
1 _ 
2 
0 
3 
FB 1 
1 1 o 180 
o 0 
0 0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 
0 0 
3 3 
1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 
3 
FB 1 
1 1 o 180 
o 0 
003 
3 
1 000 
F 3E5BAFC8 
245ABDF8 
F 0 
1 _ 
2 o 3 
FB 1 
1 1 o 180 
o 0 
0 0 
3 3 
1 000 
F 00000000 
00000000 
F 00_ 
2 
1 3 
9F 
1 
1 1 o 1CO o 0 
0 0 
3 3 
1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 
3 
FB 1 
1 1 o 180 
o 0 
0 0 
3 3 
1 000 
F 3E77E333 
3AFF1AB4 
F 0 
1 _ 
2 
0 
3 
FB 1 
1 1 o 180 
o 0 
0 
0 
3 3 
1 000 
F 00000000 
00000000 
F 00_ 
2 
1 3 
9F 
1 
1 1 o 1CO o 0 
0 0 
3 3 
1 000 
F 00000000 
00000000 
F 
00_ 
2 o 3 
FB 1 
1 1 o 180 
o 0 
0 0 
3 3 
1 000 
F 3E94E3A4 
D4220C9C 
F 
0 
1 _ 
2 o 3 
FB 1 
1 1 o 180 
o 0 
0 0 
3 3 
1 000 
F 00000000 
00000000 
F 00_ 
2 
1 3 
9F 
1 
1 1 o 1CO o 0 
0 0 
3 3 
1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 
3 
FB 1 
1 1 o 180 
o 0 0 
0 
3 3 
1 000 
F 3EB296DD 
4C084ACB 
F 
0 
1 _ 
2 
0 
3 
FB 1 
1 1 o 180 
0000331 
000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO 
000 
0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 1 
1 1 o 180 
000 
0 
3 3 
1 000 
F 3EDOE913 
F5F9D496 
F 0 
1 _ 
2 
0 
3 
FB 1 
1 1 o 180 
000 
0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 000 
3 3 
1 000 
F 00000000 
00000000 
F 
0 0 _ 
2 
0 
3 
FB 
1 
1 1 o 180 
0000331 
000 
F 3EEFA74E 
896F8FA8 
F 0 
1 _ 
2 
0 
3 
FB 1 
1 1 o 180 
0000331 
000 
F 00000000 
00000000 
F 00_ 
2 
1 3 
9F 
1 
1 1 o 1CO o 0 
0 0 
3 3 
1 000 
F 00000000 
00000000 
F J 
0 _ 
2 
0 
3 
FB 
1 
1 1 o 180 
0000331 
000 


Microcode Table for the ArcSine(x) and ArcCosine(x) Calculation (Concluded) 


p 
D 
D 
PEE 
C 
P 
C C s 
R H E 
F 
I 
R F S B S T S 000 
A 
A 
B 
B N N L 
I 
L 0 
E 
E A N 
L 
N 
N A RYE 
E E E E E 
A B K 
P 
K N 
L 
S 
L C 
0 
S 
D S C T L SLY 
S C 
C 
EMF 
0 
E T 
W 
T 
T C EST 
Y 
SOl 
P 
T 
C 
R 
P T 
D G 


F 3FOEC76B 
7832DBB6 
F 0 1 _ 
2 o 3 
FB 
1 
1 1 o 180 
000033100 
0 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO 
o 0 0 0 3 3 1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 
1 
1 1 o 180 
o 0 0 0 3 3 1 000 
F 3F2F9786 
98C8B2E4 
F01_2 
o 3 
FB 
1 
1 1 o 180 o 0 0 0 3 3 1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 0 0 3 3 1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 
1 
1 1 o 180 o 0 0 0 3 3 1 000 
F 3F519B10 
87542073 
F01_2 
o 3 
FB 
1 
1 1 o 180 o 0 0 0 3 3 1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 0 0 3 3 1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 
1 
1 1 o 180 o 0 0 0 3 3 1 000 
F 3F769689 
5FFC05AO 
F01_2 
o 3 
FB 
1 
1 1 o 180 o 0 0 0 3 3 1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 0 0 3 3 1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 
1 
1 1 o 180 o 0 0 0 3 3 1 000 
F 3FA375CA 
61D2988C 
F 0 1 _ 
2 o 3 
FB 
1 
1 1 o 180 o 0 0 0 3 3 1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 0 0 3 3 1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 
1 
1 1 o 180 
000033100 
0 
F 3FE7B204 
23D 1D930 
F 0 1 _ 
2 
0 3 
FB 
1 
1 1 o 180 
000033100 
0 
F 3FE6A09E 
667F3BCD 
F 0 
1 _ 
2 
1 3 
BF 
1 
1 1 o 1CO 
000033100 
0 
F 3FF6A09E 
667F3BCD 
F 0 0 _ 
2 
0 3 
EF 
1 
1 1 o 1CO o 000 
3 3 1 000 
F 00000000 
00000000 
F 
1 0 _ 
2 o 3 
EF 
1 
1 1 o 1CO o 000 
3 3 1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FF 
1 
1 1 o 180 
000033100 
0 
F 00000000 
00000000 
F 00_ 
2 o 3 
FF 
1 
1 1 o 300 
000033100 
0 
F 3FF921FB 
54442D18 
F 0 1 _ 
2 
1 3 
FB 
1 
1 1 o 183 
00003 
3 0 000 
F 00000000 
00000000 
F 00_ 
2 o 3 
FF 
1 
1 1 o 300 
o 0 0 0 3 3 1 000 


-..J 
F 00000000 
00000000 
F 00_ 
2 o 3 
FF 
1 
1 1 o 300 
00003 
3 0 000 
0> 
....• 


STEP 1 - 
Preprocessing; If the magnitude of the input, X, is greater than 1.0, 
then the reciprocal must be taken. If the magnitude of X is not greater 
than 1.0, then pass X. Let this number (either X or 1.0/X) 
be referred 
to as X1. Next multiply X1 times 2.0, then multiply this resulting 
number by X1. Finally, subtract 1.0 from this last product. 


If IXI 
> 1.0 
Then X1 ~ 
1.0/X 
Else X1 ~ X 
X2 ~ X1 *2.0*X1 
- 
1.0 


STEP 2 - 
Core Calculation; X2 in Step 1 will be referred to as 'x' in the core 
calculation. 


~ 
((((((((((((((((((C19*X +C1a)*X 
+ C17)*X + C1S)*X + C1S)*X + 
C14)*X + C13)*x 
+ C12)*x 
+ C11)*x 
+ C10)*X + cg)*x 
+ca)*x 
+ C7)*x + CS)*x + CS)*x + C4)*X + C3)*X + C2)*X 
+ c1)*x 
+ cO 


STEP 3 - 
Postprocessing; multiply the output of the core calculation times X1. 
Let this number be referred to as X4. The next computation will yield 
the answer. If X was greater than 1.0, then subtract X4 from pi/2. 
If X was less than - 1.0, then subtract X4 from - pi/2. If neither of 
the two conditions above are true, then X4 is the answer. 


X4 ~ X3*X1 
If X > 1.0 
Then ArcTangent(X) ~ pi/2 - 
X4 
Else If X < -1.0 
Then ArcTangent(X) ~ 
- pi/2 - 
X4 
Else ArcTangent(X) ~ X4 


If IXI 
> 1.0 
Then T1 +- 1.0/X 
T2 +- T1 *2.0 
T3 +- T2*CREG 
T4 +- T3 - 
1.0 
Else T1 +- X 
T2 +- T1 *2.0 
T3 +- T2*T1 
T4+-T3 
- 
1.0 


T5 
+-C19*CREG 
T6 
+-T5 + c18 
T7 
+-T6*CREG 
T8 
+-T7 + c17 
T9 
+-T8*CREG 
T10+-T9 
+ c16 
T11 +-T10*CREG 
T12 +-T11 
+ c15 
T13+-T12*CREG 
T14+-T13 
+ c14 
T15 +-T14*CREG 
T16+-T15 
+ c13 
T17 +-T16*CREG 
T18+-T17 
+ c12 
T19 +-T18*CREG 
T20+-T19+C11 
T21 +-T20*CREG 
T22 +-T21 
+ c10 
T23 +-T22*CREG 
T24 +-T23 + c9 
T25 +-T24*CREG 
T26 +-T25 
+ c8 
T27 +-T26*CREG 
T28 +-T27 
+ c7 
T29 +-T28*CREG 
T30 +-T29 + c6 


T1 is X 1 in Step 1, must be stored 
externally 
CREG +- T1 


T31 -T30*CREG 
T32 -T31 
+ c5 
T33 -T32*CREG 
T34 - T33 + c4 
T35 -T34*CREG 
T36 -T35 
+ c3 
T37 -T36*CREG 
T38 - T37 + c2 
T39 -T38*CREG 
T40 -T39 
+ c1 
T41 -T40*CREG 
T42 -T41 
+ co 


T43 - 
T42*T1 
If X > 1.0 
CREG - 
T43 
Then ArcTangent(X) - 
pi/2 - 
CREG 
Return 
If X < -1.0 
Then ArcTangent(X) - 
- pi/2 - 
CREG 
Return 
ArcTangent(X) - 
CREG 


Required System Intervention 


As seen in the algorithm for Step 1, the' ACT884 7 performs a compare. The results 
of this compare determine what kind of preproccessing is to be performed. In Step 3, 
there are two more compare operations. The system must therefore perform additional 
decision making. In addition, the system must store T1, and later (inthe postprocessing) 
provide this value to the 'ACT884 7. 


Number of 'ACT8847 
Cycles Required to Calculate ArcTangent(x) 


Calculation 
of ArcTangent(x) 
requires at most 
89 
cycles 
(including the divide 
instruction). 
In addition, it is assumed that 15 additional cycles are required due to 
the compare instructions, and resulting system intervention. Therefore, the total number 
of cycles to perform the ArcTangent(x) calculation is 104. 


C19 = BDC4D6CC6308553F 
C18 = 3DDFFD56FCFD2315 
cn 
= BDE880782D99D071 
c16 = 3E0409670CB71218 
c15 = BE237C8239249B77 
c14 = 3E3F1358EC1 
D6ACO 
c13 
= BE587CD25F4AFBED 
c12 = 3E73D2388BOB8A86 
c11 = BE9028E921CA6A94 
c10 = 3EAA814997A38D4E 
c9 = BEC5EDAD9A21 
FE5F 
c8 = 3EE256E57BA07FAE 
c7 = BEFF171F48FDF707 
c6 = 3F1 ACFA9F95CAODF 
c5 = BF37 A8464221 
D994 
c4 = 3F558DF7 A83283C9 
c3 = BF749B3E2E433683 
c2 = 3F955A300BFB8078 
c1 = BFBA 1494C19FADD4 
cO = 3FEBDA7A85BD40CB 


CLK 
DA 
DB 
RA 
RB 
CLK 
MUL 
ALU 
P 
C 
S 
y 
COMMENT 
INSTR 
BUS 
BUS 
REG 
REG 
MODE 
PIPE 
PIPE 
REG 
REG 
REG 
BUS 


1 
1.0 MSH 
1.0 LSH 
0 
COMPARE 
X is the input 


RA2. IRB21 
Compare 1.0 and ABS(X) 


RA2.RB2 
If ABS(X) is greater than 


2 
X MSH 
X LSH 
1.0 
X 
0 
1.0 perform 1.0/X. 
other- 
RA2. IRB21 
wise go to cycle 16b 


3 
1.0 
X 
0 
NOP 
Wait for system response 


4 
1.0 
X 
1 
DIV 
Divide: 1.0/X 


5 
1.0 
X 
0 
NOP 
Wait for Division result 


6 
1.0 
X 
0 
NOP 
Wait for Division result 


7 
1.0 
X 
0 
NOP 
Wait for Division result 


8 
1.0 
X 
0 
NOP 
Wait for Division result 


9 
1.0 
X 
0 
NOP 
Wait for Division result 


10 
1.0 
X 
0 
NOP 
Wait for Division result 


11 
1.0 
X 
0 
NOP 
Wait for Division result 


12 
1.0 
X 
0 
NOP 
Wait for Division result 


13 
1.0 
X 
0 
NOP 
Wait for Division result 


14 
1.0 
X 
0 
NOP 
Wait for Division result 


15 
1.0 
X 
0 
NOP 
Wait for Division result 


16a 
2.0 MSH 
2.0 LSH 
1.0 
X 
0 
RA17.PR17 
P1 
P1 
If the reciprocal of X was 


17a 
2.0 
X 
0 
RA17.PR17 
P1 
performed. then execute 


18a 
2.0 
X 
0 
CR19.PR19 
RA17.PR17 
P1 
cycles 16a through 19a 


In cycles 16a and 17 a out- 


19a 
2.0 
X 
0 
CR19.PR19 
P2a 
put P1 and store it for 


use in cycle 79 


16b 
2.0 MSH 
2.0 lSH 
1.0 
X 
0 
RA17.RB17 
If the reciprocal o'f X was 


ClK 
DA 
DB 
RA 
RB 
ClK 
MUl 
AlU 
P 
C 
S 
y 
COMMENT 
INSTR 
BUS 
BUS 
REG 
REG 
MODE 
PIPE 
PIPE 
REG 
REG 
REG 
BUS 


17b 
2.0 
X 
0 
RA17.RB17 
not performed. 
then execute 


18b 
X MSH 
X LSH 
X 
X 
0 
RA19.PR19 
RA17.RB17 
cycle 16b through 
19b 


19b 
X 
X 
0 
RA19.PR19 
P2b 


2.0 
CR19.PR19 
The RA register is not 


20 
or 
X 
0 
PR21 +RB21 
or 
used again until cycle 81 


X 
RA19.PR19 
so rather than indicating 


2.0 
P3a 
the contents' 
2.0 


21 
-1.0 
MSH 
-1.0 
LSH 
or 
-1.0 
0 
PR21 +RB21 
or 
of RA as: 
or 


X 
P3b 
X' 


22 
c19 MSH 
c19 LSH 
use the term' 
2 or X' 
2 or X 
c19 
1 
SR22.RB22 
Sl 
Start the core calculation 


23 
2 or X 
c19 
0 
PR24+RB24 
SR22.RB22 
Sl 


24 
c18 MSH 
c18 LSH 
2 or X 
c18 
0 
PR24+RB24 
P4 


25 
2 or X 
c18 
1 
SR25.CR25 
S2 


26 
2 or X 
c18 
0 
PR27+RB27 
SR25.CR25 


27 
c17 MSH 
c17 LSH 
2 or X 
cn 
0 
PR27+RB27 
P5 


28 
2 or X 
cn 
1 
SR28.CR28 
S3 


29 
2 or X 
cn 
0 
PR30+RB30 
SR28.CR28 


30 
c16 MSH 
c16 LSH 
2 or X 
c16 
0 
PR30+RB30 
P6 


31 
2 or X 
c16 
1 
SR31.CR31 
S4 


32 
2 or X 
c16 
0 
PR33+RB33 
SR31.CR31 


33 
c15 MSH 
c15 LSH 
2 or X 
c15 
0 
PR33+RB33 
P7 


34 
2 or X 
c15 
1 
SR34.CR34 
S5 


35 
2 or X 
c15 
0 
PR36+RB36 
SR34.CR34 


36 
c14 MSH 
c14 LSH 
2 or X 
c14 
0 
PR36+RB36 
P8 


CLK 
DA 
DB 
RA 
RB 
CLK 
MUL 
ALU 
P 
C 
S 
y 
COMMENT 
INSTR 
BUS 
BUS 
REG 
REG 
MODE 
PIPE 
PIPE 
REG 
REG 
REG 
BUS 


37 
2 or X 
c14 
1 
SR37.CR37 
56 


38 
2 or X 
c14 
0 
PR39+RB39 
SR37.CR37 


39 
c13 MSH 
c13 LSH 
2 or X 
c13 
0 
PR39+RB39 
P9 


40 
2 or X 
c13 
1 
SR40.CR40 
57 


41 
2 or X 
c13 
0 
PR42+RB42 
SR40.CR40 


42 
c12 MSH 
c12 LSH 
2 or X 
c12 
0 
PR42+RB42 
P10 


43 
2 or X 
c12 
1 
SR43.CR43 
58 


44 
2 or X 
c12 
0 
PR45+RB45 
SR43.CR43 


45 
c11 MSH 
c11 LSH 
2 or X 
cll 
0 
PR45+RB45 
P11 


46 
2 or X 
cll 
1 
SR46.CR46 
59 


47 
2 or X 
cll 
0 
PR48+RB48 
SR46.CR46 


48 
clO 
MSH 
clO 
LSH 
2 or X 
clO 
0 
PR48+RB48 
. 
P12 


49 
2 or X 
clO 
1 
SR49.CR49 
510 


50 
2 or X 
clO 
0 
PR51 +RB51 
SR49.CR49 


51 
c9 MSH 
c9 LSH 
2 or X 
c9 
0 
PR51 +RB51 
P13 


52 
2 or X 
c9 
1 
SR52.CR52 
511 


53 
2 or X 
c9 
0 
PR54+RB54 
SR52.CR52 


54 
c8 MSH 
c8 LSH 
2 or X 
c8 
0 
PR54+RB54 
P14 


55 
2 or X 
c8 
1 
SR55.CR55 
512 


56 
2 or X 
c8 
0 
PR57+RB57 
SR55.CR55 


57 
c7 MSH 
c7 LSH 
2 or X 
c7 
0 
PR57 +RB57 
P15 


58 
2 or X 
c7 
1 
SR58.CR58 
513 


59 
2 or X 
c7 
0 
PR60+RB60 
SR58.CR58 


60 
c6 MSH 
c6 LSH 
2 or X 
c6 
0 
PR60+RB60 
P16 


ClK 
DA 
DB 
RA 
RB 
ClK 
MUl 
AlU 
P 
C 
S 
y 
COMMENT 
BUS 
BUS 
REG 
REG 
MODE 
INSTR 
PIPE 
PIPE 
REG 
REG 
REG 
BUS 


61 
2 or X 
C6 
1 
SR61.CR61 
S14 


62 
2 or X 
c6 
0 
PR63+RB63 
SR61.CR61 


63 
c5 MSH 
c5 lSH 
2 or X 
c5 
0 
PR63+RB63 
P17 


64 
2 or X 
c5 
1 
SR64.CR64 
S15 


65 
2 or X 
c5 
0 
PR66+RB66 
SR64.CR64 


66 
c4 MSH 
c4 lSH 
2 or X 
c4 
0 
PR66+RB66 
P18 


67 
2 or X 
c4 
1 
SR67.CR67 
S16 


68 
2 or X 
c4 
0 
PR69+RB69 
SR67.CR67 


69 
c3 MSH 
c3 lSH 
2 or X 
c3 
0 
PR69+RB69 
P19 


70 
2 or X 
c3 
1 
SR70.CR70 
S17 


71 
2 or X 
C3 
0 
PR72+RB72 
SR70.CR70 


72 
c2 MSH 
c2 lSH 
2 or X 
c2 
0 
PR72+RB72 
P20 


73 
2 or X 
C2 
1 
SR73.CR73 
S18 


74 
2 or X 
c2 
0 
PR75+RB75 
SR73.CR73 


75 
cl 
MSH 
cl 
lSH 
2 or X 
cl 
0 
PR75+RB75 
P21 


76 
2 or X 
cl 
1 
SR76.CR76 
S19 


77 
2 or X 
cl 
0 
PR78+RB78 
SR76.CR76 


78 
cO MSH 
cO lSH 
2 or X 
Co 
0 
PR78+RB78 
P22 


Tl 
is either Pl or is X 


79 
T1 MSH 
T1 LSH 
2 or X 
T1 
1 
SR79.RB79 
S20 
depending on what action 
was called for at cycle 2 


Begin the post processing 


If X > 1.0 then execute 


80 
X MSH 
X LSH 
2 or X 
T1 
0 
COMPARE 
83 through B6. otherwise 


X.l.O 
skip to 83b. In either case 


execute 80 through B2 


ClK 
DA 
DB 
RA 
RB 
ClK 
MUl 
AlU 
P 
C 
S 
y 
COMMENT 
INSTR 
BUS 
BUS 
REG 
REG 
MODE 
PIPE 
PIPE 
REG 
REG 
REG 
BUS 


81 
1.0 M5H 
1.0 l5H 
X 
1.0 
0 
COMPARE 
P23 
X,l.0 


82 
X 
1.0 
0 
NOP 
P23 
Wait for system response 


83 
X 
1.0 
0 
RB84-CR84 
Execute if X > 1.0 


84 
pi/2 M5H 
pi/2 L5H 
X 
pi/2 
0 
RB84-CR84 


85 
X 
pi/2 
0 
NOP 
521a 
521a 
Output M5H of answer 


86 
X 
pi/2 
0 
NOP 
521a 
521a 
Output l5H 
of answer 


The calculation is done 


Execute if X ~ 1.0. 


If - 1.0 > X then execute 


83b -1.0 
M5H 
-1.0 
L5H 
X 
1.0 
0 
COMPARE 
86b through 89b, otherwise 
-1.0,X 
skip to 86c. In either case 


execute 83b thru 85b 


84b 
X M5H 
X L5H 
-1.0 
0 
COMPARE 
P23 
X 
-1.0,X 


85b 
-1.0 
X 
0 
NOP 
P23 
Wait for system response 


86b 
-1.0 
X 
0 
RB87 -CR87 
Execute if - 1.0 > X 


87b 
-pi/2 
-pi/2 
-1.0 
-pi/2 
0 
RB87 -CR87 
M5H 
L5H 


88b 
-1.0 
-pi/2 
0 
NOP 
521b 
521b 
Output M5H of answer 


89b 
-1.0 
pi/2 
0 
NOP 
521b 
521b 
Output L5H of answer. 


The calculation is done. 


86c 
-1.0 
X 
1 
PA55(CR86) 
Execute if X is within the 


range [- 
1,1]. Pass CREG 


87c 
-1.0 
X 
0 
NOP 
521c 
521c 
Output M5H of answer 


88c 
-1.0 
X 
0 
NOP 
521c 
521c 
Output L5H of answer 


Microcode Table for the ArcTangent(x) 
Calculation 


All numbers are in hex. Any field with a length that is not a multiple of 4 is right justified and zero filled. For the microcode 
table, the value of X has been chosen to be SQRT(.3.01. 


p 
D 
D 
PEE 
C 
P 
C C 
S 
R H E 
F 
I 
R F S B S T S 555 
A 
A 
B 
B N N L 
I 
L 0 
E 
E A N 
L 
N 
N A RYE 
E E E E E 
A B K 
P 
K N 
L 
S 
L C 0 
S 
D S C T 
L SLY 
S C 
C 
EMF 
0 
E T 
W 
T 
T C EST 
Y 
SOl 
P 
T 
C 
R 
P T 
D G 


F 3FFOOOOO00000000 
FOO_2 
o 3 
FF 
1 
1 1 o 18A 
o 0 0 0 
3 3 
1 000 
F 3FFBB67A 
E8584CAB 
F 
1 1 _ 
2 o 3 
FF 
1 
1 1 o 18A 
0000331 
000 
F 00000000 
00000000 
FOO_2 
o 3 
FF 
1 
1 1 o 300 
o 0 
0 0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
FF 
1 
1 1 o 1EO o 0 
0 0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FF 
1 
1 1 o 300 
o 0 
0 0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FF 
1 
1 1 o 300 
o 0 
0 0 
3 3 
1'0 
0 
0 
F 00000000 
00000000 
FOO_2 
o 3 
FF 
1 
1 1 o 300 
o 0 
0 0 
3 
3 
1 000 
F 00000000 
00000000 
F 
0 0 _ 
2 
0 
3 
FF 
1 
1 1 o 300 
o 0 0 0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FF 
1 
1 1 o 300 
o 0 
0 0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FF 
1 
1 1 o 300 
o 0 
0 0 
3 3 
1 000 
F 00000000 
00000000 
F 00_ 
2 o 3 
FF 
1 
1 1 o 300 
o 0 
0 0 
3 3 
1 000 
F 00000000 
00000000 
F 00_ 
2 o 3 
FF 
1 
1 1 o 300 
o 0 
0 0 
3 3 
1 000 
F 00000000 
00000000 
F 00_ 
2 o 3 
FF 
1 
1 1 o 300 
000 
0 
3 3 
1 000 
F 00000000 
00000000 
F 
00_ 
2 o 3 
FF 
1 
1 1 o 300 
o 0 
0 0 
3 3 
1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 
3 
FF 
1 
1 1 o 300 
o 0 0 0 
3 
3 
1 000 
F 40000000 
00000000 
F 
0 0 _ 
2 
0 
3 
EF 
1 
1 1 o 1CO o 0 
0 0 
3 3 
1 000 
F 00000000 
00000000 
F 
1 0 _ 
2 o 3 
EF 
1 
1 1 o 1CO o 0 
0 0 
3 3 0 0 
0 
0 
F 00000000 
00000000 
FOO.I2 
o 3 
6F 
1 
1 0 
0 
1CO 
0010331 
000 
F 00000000 
00000000 
FOO_2 
o 3 
6F 
1 
1 1 o 1CO o 0 0 0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 1 
1 1 o 180 
o 0 0 0 
3 3 
1 000 
F BFFOOOOO00000000 
F01_2 
o 3 
FB 1 
1 1 o 180 
o 0 
0 0 
3 3 
1 000 


-...J 
F BDC4D6CC 6308553F 
F01_2 
1 3 
BF 1 
1 1 o 1CO o 0 0 0 
3 3 
1 000 
, 
-...J 
....• 


--.J 
~ 
N 
Microcode Table for the ArcTangent(x) 
Calculation (Continued) 


p 
D 
D 
PEE 
C 
P 
C C s 
R H E 
F 
I 
R F S B S T S a a a 
A 
A 
B 
B N N L 
I 
L 0 
E 
E A N 
L 
N 
N A RYE 
E E E E E 
A B K 
P 
K N 
L 
S 
L C 0 
S 
D S C T 
L SLY 
S C 
C 
EMF 
0 
E T 
W 
T 
T C EST 
Y 
SOl 
P 
T 
C 
R 
P T 
D G 


F 00000000 
00000000 
FOO..r2 
o 3 
FB 1 
1 0 
0 180 
0000331 
000 
F 3DDFFD56 FCFD2315 
F 0 
1 _ 
2 
0 3 
FB 1 
1 1 
o 180 
0000331 
000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 
o 1CO o 0 003 
3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 1 
1 1 
o 180 
o 0 003 
3 
1 000 
F BDE88078 
2D99D071 
F01_2 
o 3 
FB 1 
1 1 
o 180 
o 0 003 
3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 
o 1CO o 0 003 
3 
1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FB 1 
1 1 
o 180 
o 0 003 
3 
1 000 
F 3E040967 
OCB71218 
F 0 
1 _ 
2 
0 3 
FB 1 
1 1 
o 180 
o 0 003 
3 
1 000 
F 00000000 
00000000 
F 00_ 
2 
1 3 
9F 
1 
1 1 
o 1CO o 0 0 0 3 3 
1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FB 1 
1 1 
o 
180 
o 0 0 0 3 3 
1 000 
F BE237C82 
39249B77 
F 0 
1 _ 
2 
0 3 
FB 1 
1 1 
o 180 
o 0 0 0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 
o 1CO 000 
0 3 3 
1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FB 1 
1 1 
o 180 
000 
0 3 3 
1 000 
F 3E3F1358 
EC1D6ACO F 0 
1 _ 
2 
0 3 
FB 1 
1 1 
o 180 
o 0 0 0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 
o 1CO 000 
0 3 3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 1 
1 1 
o 180 
000 
0 3 3 
1 000 
F BE587CD2 
5F4AFBED F01_2 
o 3 
FB 1 
1 1 
o 
180 
000 
0 3 3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 
o 1CO 000 
0 3 3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 1 
1 1 
o 180 
000 
0 3 3 
1 000 
F 3E73D238 
8BOB8A86 
F 
0 
1 _ 
2 
o 3 
FB 1 
1 1 
o 
180 
000 
0 3 3 
1 000 
F 00000000 
00000000 
F 00_ 
2 
1 3 
9F 
1 
1 1 
o 
1CO o 0 0 0 3 3 
1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FB 1 
1 1 
o 
180 
o 0 0 0 
3 3 
1 000 
F BE9028E9 
21CA6A94 
F 0 
1 _ 
2 
o 3 
FB 1 
1 1 
o 
180 
o 0 0 0 
3 3 
1 000 


Microcode Table for the ArcTangent(x) Calculation (Continued) 


p 
D 
D 
PEE 
C 
P 
C C s 
R H E 
F 
I 
R F S B S T S 000 
A 
A 
B 
B N N L 
I 
L 0 
E 
E A N 
L 
N 
N A RYE 
E E E E E 
A B K 
P 
K N 
L 
S 
L C 
0 
S 
D S C T 
L SLY 
S C 
C 
EMF 
0 
E T 
W 
T 
T C EST 
Y 
SOl 
P 
T 
C 
R 
P T 
D G 


F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 
0 
0 
3 
3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 
1 
1 1 o 180 
o 0 0 
0 
3 3 
1 000 
F 3EAA8149 
97A38D4E 
F 0 
1 _ 
2 o 3 
FB 1 
1 1 o 180 
o 0 
0 0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 
0 0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 1 
1 1 o 180 
000 
0 
3 3 
1 000 
F BEC5EDAD 9A21 FE5F 
F01_2 
o 3 
FB 1 
1 1 o 180 
000 
0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO 
000 
0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 1 
1 1 o 180 
000 
0 
3 3 
1 000 
F 3EE256E5 
7BA07FAE 
F 0 
1 _ 
2 
0 
3 
FB 
1 
1 1 o 180 
o 0 
0 
0 
3 
3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 
0 
0 
3 
3 
1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 
3 
FB 1 
1 1 o 180 
o 0 
0 
0 
3 
3 
1 000 
F BEFF171F 
48FDF707 
F01_2 
o 3 
FB 1 
1 1 o 180 
o 0 
0 0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 
0 
0 
3 3 
1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 
3 
FB 1 
1 1 o 180 
o 0 
003 
3 
1 000 
F 3F1ACFA9 
F95CAODF F 0 
1 _ 
2 o 3 
FB 
1 
1 1 o 180 
o 0 
0 0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO 
000033100 
0 
F 00000000 
00000000 
FOO_2 
o 3 
FB 
1 
1 1 o 180 
000033100 
0 
F BF37A846 
4221 D994 
F 0 
1 _ 
2 o 3 
FB 1 
1 1 o 180 
000033100 
0 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO 000033100 
0 
F 00000000 
00000000 
FOO_2 
o 3 
FB 1 
1 1 o 180 
000033100 
0 
F 3F558DF7 
A83283C9 
F 0 
1 _ 
2 o 3 
FB 
1 
1 1 o 180 
o 0 
0 0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO 
0000331 
000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 
1 
1 1 o 180 
o 0 
0 0 
3 3 
1 000 


';J 
F BF749B3E 
2E433683 
F01_2 
o 3 
FB 
1 
1 1 o 180 
o 0 
0 0 
3 3 
1 000 


-oJ 
W 


~'-I~ 
Microcode Table for the ArcTangent(x) 
Calculation (Concluded) 


p 
D 
D 
PEE 
C 
P 
C C s 
R H E 
F 
I 
R F S B S T S a a a 
A 
A 
B 
B N N L 
I 
L 0 
E 
E A N 
L 
N 
N A RYE 
E E E E E 
A B K 
P 
K N 
L 
S 
L C 0 
S 
D S C T 
L SLY 
S C 
C 
EMF 
0 
E T 
W 
T 
T C EST 
Y 
SOl 
P 
T 
C 
R 
P T 
D G 


F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 0 0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 1 
1 1 o 180 
0000331 
000 
F 3F955A30 
OBFB8078 
F01_2 
o 3 
FB 
1 
1 1 o 180 
o 000 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO 
0000331 
000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 1 
1 1 o 180 
000033100 
0 
F BFBA1494 
C19FADD4 
F 0 
1 _ 
2 o 3 
FB 1 
1 1 o 180 
o 000 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 000 
3 3 
1 000 
F 00000000 
00000000 
F 
00_ 
2 o 3 
FB 1 
1 1 o 180 
o 000 
3 3 
1 000 
F 3FEBDA7A 
85BD40CB 
F 0 
1 _ 
2 o 3 
FB 
1 
1 1 o 180 
0000331 
000 
F 3FE279A7 
4590331C 
F 0 
1 _ 
2 
1 3 
BF 
1 
1 1 o 1CO 
0000331 
000 
F 3FFBB67A 
E8584CAB 
F 0 0 _ 
2 o 3 
FF 
1 
1 1 o 182 
o 0 
0 0 
3 
3 
1 000 
F 3FFOOOOO 00000000 
F 
1 
1 _ 
2 o 3 
FF 
1 
1 1 o 182 
o 0 
0 0 
3 3 
1 000 
F 00000000 
00000000 
F 0 0 I 
2 
0 
3 
FF 
1 
1 0 
0 300 
0010331 
000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 
3 
F7 
1 
1 1 o 183 
000 
0 
3 3 
1 000 
F 3FF921FB 
54442D18 
F 
0 
1 _ 
2 o 3 
F7 
1 
1 1 o 183 
o 000 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FF 
1 
1 1 o 300 
o 000 
3 3 
1 000 
F 00000000 
00000000 
F 00_ 
2 o 3 
FF 
1 
1 
1 o 300 
00003 
3 0 
000 


All floating-point inputs and outputs are double precision. 


Steps Required to Perform the Calculation 


STEP 1 - 
Preprocessing;first multiply the input, X, by 1092e(yielding X1). Next, 
convert this product to an integer, using truncate mode (yielding X2). 
Form the variable EX by adding 1024 to X2. EX is used in the 
postprocessing part of the routine. Subtract 1023 from EX to find 
the variable N (N is actually X2 incremented by 1). Convert N to a 
floating-point number (yielding X3). Subtract X1 from X3, multiply 
this difference by 2.0, and then finally subtract 
1.0. This last 
computation is the input to the core routine. 


X1 +- X*1092e 
X2 +- TRUNC(X1) 
EX +- 1024 + X2 
N +- 
EX - 
1023 
X3 +- DOUBLE(N) 
X4 +- 2.0*(X3 
- 
X1) - 
1.0 


TEP 2 - 
Core Calculation; X4 in Step 1 will be referred to as 'x' in the core 
calculation. 


X5 +- Cseries_exp 


+- ((((((((((c11 *x + C10)*X + cg)*x 
+ C8)*x + c7)*x 
+ C6)*x + 
C5)*x + c4)*x 
+ C3)*x + C2)*x + C1)*x + cO 


STEP 3 - 
Postprocessing; multiply the output of the core calculation times 2N. 
To generate 2N, perform the following: shift left logical 20 positions 
(bits) the variable EX (which was calculated in Step 1). The resulting 
bit pattern will be the double precision floating-point representation 
of 2N. However, the' ACT884 7 will not at this point recognize the 
bit pattern as a floating-point number. So this number must be output 
from the Y bus, and then input (declaring the input to be a double 
precisionfloating-point number) on the input bus. Now the' ACT884 7 
will process 2N as a double float, and so the core output, X5, can 
be multiplied by 2N to produce the final result. 'SLL' means to shift 
left logical. 


X6 +- EX SLL by 20 bits 
Y bus +- X6 
DA bus +- Y bus 
Exp(X) +- X5 * X6 


T1 
+-X*lo92e 
T2 
+-INT(T1) 
T3 
+-1024 
+ T2 


T4 
+- T3 - 
1023 
T5 
+-1*T4 
T6 
+-DOUBLE(T5) 
T7 
+-T6 - 
CREG 
T8 
+-2.0*T7 
T9 
+-T8 
- 
1.0 


T10 +-c11 *CREG 
T11 +-no 
+ c10 
T12 +-n1 
*CREG 
T13+-T12 
+ c9 
T14+-T13*CREG 
T15 +-T14 + C8 
T16 +-T15*CREG 
T17 +-T16 + C7 
T18 +-T17 *CREG 
n9 +- n8 + c6 
T20 +-n9*CREG 
T21 +-T20 + c5 
T22 +-T21 *CREG 
T23 +-T22 + C4 
T24 +-T23*CREG 
T25 +-T24 + c3 
T26 +-T25*CREG 
T27 +-T26 + C2 
T28 +-T27*CREG 
T29 +-T28 + C1 
T30 +-T29*CREG 
T31 +-T30 + cO 


lo92e entered as a constant 
round controls set to truncate 
T3 is EX in Step 1, must be 
stored externally, 
CREG +- T1 


makes T4 available to A2 MUX 
convert from integer to double 


T9 is X4 in Step 1, the 
input to the core routine 


Shift T3 20 bits left 
Output and then Input T32 
CREG ~ T31 
Two cycles required to 
input both halves of T32 


T32 ~ T3 SLL by 20 bits 
y bus ~ T32 


Exp(X) 
~ T32*CREG 


Required System Intervention 


The system is required to store the variable EX, and then later provide this variable. 
In addition, the system is required to route the variable T32 (in Step 3) from the Y 
bus to the DA bus. 


Calculation of Exp(x) requires 52 cycles. Since there are no decisions which the system 
is required to perform, the total number of cycle to perform the Exp(X) calculation is 52. 


c11 = BD45A7FC05D3B501 
clO 
= 3D957BFD2DBF487C 
c9 = BDE351B821AC16D5 
c8 = 3E2F5BOE17440879 
c7 = BE769E51EE631E87 
c6 = 3EBC8D7530548DD5 
c5 = BEFEE4FD234A4926 
c4 = 3F3BDB696E8987 AC 
c3 = BF741839EB88156E 
c2 = 3FA5BE298ADF0369 
c1 = BFCF5E46537AB906 
cO = 3FE6A09E667F3BCC 


CLK 
DA 
DB 
RA 
RB 
CLK 
MUL 
ALU 
P 
C 
S 
y 


BUS 
BUS 
REG 
REG 
MODE 
INSTR 
REG 
REG 
REG 
BUS 
COMMENT 
PIPE 
PIPE 


1 
X MSH 
X LSH 
0 
RA2.RB2 
X is the input 


2 
L092e 
L092e 
X 
L092e 
0 
RA2.RB2 
MSH 
LSH 


3 
X 
L092e 
0 
DP21(PR4) 
RA2.RB2 
Double-precision 
-+ integer 


4 
X 
L092e 
0 
DP21(PR4) 
P1 


5 
1024 
1024 
L092e 
0 
RA5+SR5 
P1 
S1 


Store S2, which 
is the 


6 
-1023 
-1023 
L092e 
0 
RA6+SR6 
S2 
S2 
variable 
EX, for use in 


cycle 46 


7 
1 
-1023 
1 
0 
SR7.RB7 
S3 


8 
-1023 
1 
1 
12DP(PR8) 
P2 
Integer 
-+ double-precision 


9 
-1023 
1 
1 
SR9-CR9 
S4 


10 
2.0 MSH 
2.0 LSH 
-1023 
2.0 
1 
SR10.RB10 
S5 


11 
-1023 
2.0 
0 
PR12 + RB12 
SR10.RB10 


12 
-1.0 
MSH 
-1.0 
LSH 
-1023 
-1.0 
0 
PR12 + RB12 
P3 


Start core calculation, 


13 
c11 MSH 
c11 LSH 
-1023 
c11 
1 
SR13.RB13 
S6 
S6 is the input to the 


core calculation 


14 
-1023 
c11 
0 
PR15+RB15 
SR13.RB13 
S6 


15 
clO 
MSH 
clO 
LSH 
-1023 
clO 
0 
PR15+ RB15 
P4 


16 
-1023 
clO 
1 
SR16.CR16 
S7 


17 
-1023 
clO 
0 
PR18+ RB18 
SR16.CR16 


18 
c9 MSH 
c9 LSH 
-1023 
c9 
0 
PR18+RB18 
P5 


19 
-1023 
c9 
1 
SR19.CR19 
S8 


20 
-1023 
c9 
0 
PR21 +RB21 
SR19.CR19 


ClK 
DA 
DB 
RA 
RB 
ClK 
MUl 
AlU 
P 
C 
S 
y 
BUS 
BUS 
REG 
REG 
INSTR 
BUS 
COMMENT 
MODE 
PIPE 
PIPE 
REG 
REG 
REG 


21 
c8 MSH 
c8 LSH 
-1023 
c8 
0 
PR21 +R821 
P6 


22 
-1023 
c8 
1 
SR22.CR22 
S9 


23 
-1023 
c8 
0 
PR24+R824 
SR22.CR22 


24 
c7 MSH 
c7 LSH 
-1023 
c7 
0 
PR24+R824 
P7 


25 
-1023 
c7 
1 
SR25.CR25 
S10 


26 
-1023 
c7 
0 
PR27+R827 
SR25.CR25 


27 
c6 MSH 
c6 LSH 
-1023 
c6 
0 
PR27+R827 
P8 


28 
-1023 
c6 
1 
SR28.CR28 
S11 


29 
-1023 
c6 
0 
PR30+R830 
SR28.CR28 


30 
c5 MSH 
c5 LSH 
-1023 
c5 
0 
PR30+R830 
P9 


31 
-1023 
c5 
1 
SR31.CR31 
S12 


32 
-1023 
c5 
0 
PR33+R833 
SR31.CR31 


33 
c4 MSH 
c4 LSH 
-1023 
c4 
0 
PR33+R833 
P10 


34 
-1023 
c4 
1 
SR34.CR34 
S13 


35 
-1023 
c4 
0 
PR36+R836 
SR34.CR34 


36 
c3 MSH 
c3 LSH 
-1023 
c3 
0 
PR36+R836 
P11 


37 
-1023 
c3 
1 
SR37.CR37 
S14 


38 
-1023 
c3 
0 
PR39+R839 
SR37.CR37 


39 
c2 MSH 
c2 LSH 
-1023 
c2 
0 
PR39+R839 
P12 


40 
-1023 
c2 
1 
SR40.CR40 
S15 


41 
-1023 
c2 
0 
PR42+R842 
SR40.CR40 


42 
c1 MSH 
c1 LSH 
-1023 
c1 
0 
PR42+R842 
P13 


43 
-1023 
c1 
1 
SR43.CR43 
S16 


ClK 
DA 
DB 
RA 
RB 
ClK 
MUl 
ALU 
P 
C 
S 
y 


BUS 
BUS 
REG 
INSTR 
REG 
BUS 
COMMENT 
REG 
MODE 
PIPE 
PIPE 
REG 
REG 


44 
-1023 
c1 
0 
PR45+RB45 
5R43.CR43 


45 
Co M5H 
cO l5H 
-1023 
cO 
0 
PR45+RB45 
P14 


Begin post processing. 


5LL 
52 is the variable EX, and 


46 
52 
20 
52 
20 
0 
was calculated in cycle 5. 


RA46,RB46 
5hift left logical 52 


20 bit positions 


Allows time for 518 to be 
47 
52 
20 
0 
NOP 
517 
518 
518 
output from the Y bus and 


input to the DA bus 


48 
518 
52 
20 
0 
RA48.CR48 


RA holds 518', 
which is 


the double precision 
49 
0 
518' 
20 
0 
RA48.CR48 
floating-point 
equivalent 


of 2N, where N was 


calculated in cycle 6 


Instruction is RA + RB, used 
50 
518' 
20 
0 
DUMMY 
RA48.CR48 
to allow time for result 


to propagate to Y bus 


51 
518' 
20 
0 
NOP 
P15 
P15 
Output M5H of answer 


52 
518' 
20 
0 
NOP 
P15 
P14 
Output L5H of answer 


Microcode Table for the Exp(x) Calculation 


All numbers are in hex. Any field with a length that is not a multiple of 4 is right justified and zero filled. For the microcode 
table, the value of X has been chosen to be 6.25. 


p 
0 
0 
PEE 
C 
P 
C C s 
R H E 
F 
I 
R F S B S T S 555 
A 
A 
B 
B N N L 
I 
L 0 
E 
E A N 
L 
N 
N A RYE 
E E E E E 
A B K 
P 
K N 
L 
S 
L C 0 
S 
o S C T 
L SLY 
S C 
C 
EMF 
0 
E T 
W 
T 
T C EST 
Y 
SOl 
P 
T 
C 
R 
P T 
o G 


F 40190000 
00000000 
F 
00_ 
2 o 3 
FF 
1 
1 1 o 1CO o 0 0 0 
3 3 
1 000 
F 3FF71547 
652B82FE 
F11_2 
o 3 
FF 
1 
1 1 o 1CO o 0 0 0 
3 3 
1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FB 1 
1 1 o 1A3 
1 000 
3 3 
1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FB 1 
1 1 o 1A3 
1 000 
3 3 
1 000 
F 00000400 
00000000 
F10.r201 
FE 1 
1 0 
0 200 
0010331 
000 
F FFFFFC01 
00000000 
F 
10_ 
201 
FE 1 
1 1 o 200 
0000331 
000 
F 00000000 
00000001 
F01_201 
BF 1 
1 1 o 240 
0000331 
000 
F 00000000 
00000000 
FOO_2 
1 3 
FB 1 
1 1 o 1A2 o 0 0 0 
3 3 
1 000 
F 00000000 
00000000 
F 
00_ 
2 
1 3 
F6 
1 
1 1 o 183 
000 
0 
3 3 
1 000 
F 40000000 
00000000 
F01_2 
1 3 
BF 1 
1 1 o 1CO 000 
0 
3 3 
1 000 
F 00000000 
00000000 
F 
00_ 
2 o 3 
FB 1 
1 1 o 180 
o 0 0 0 
3 3 
1 000 
F BFFOOOOO00000000 
F01_2 
o 3 
FB 1 
1 1 o 180 
000 
0 
3 3 
1 000 
F B045A7FC 
0503B501 
F01_2 
1 3 
BF 1 
1 1 o 1CO 000 
0 
3 3 
1 000 
F 00000000 
00000000 
F 
0 OS 
2 o 3 
FB 1 
1 0 
0 
180 
000 
0 
3 3 
1 000 
F 30957BFO 
20BF487C 
F 
0 
1 _ 
2 
0 3 
FB 1 
1 1 o 180 
o 000 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO o 0 0 0 
3 3 
1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 1 
1 1 o 180 
o 0 0 0 
3 3 
1 000 
F BOE351B8 
21AC1605 
F 0 
1 _ 
2 o 3 
FB 1 
1 1 o 180 
000033100 
0 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 ·1 1 o 1CO 000033100 
0 
F 00000000 
00000000 
FOO_2 
o 3 
FB 1 
1 1 o 180 
000033100 
0 
F 3E2F5BOE 
17440879 
F 
0 
1 _ 
2 
0 3 
FB 1 
1 1 o 180 
000 
0 
3 3 
1 000 
-..J 
Co..• 


-.,J 
00 
N 
Microcode Table for the Exp(x) Calculation (Continued) 


p 
0 
0 
PEE 
C 
P 
C C s 
R H E 
F 
I 
R F S B S T S 555 
A 
A 
B 
B N N L 
I 
L 0 
E 
E A N 
L 
N 
N A RYE 
E E E E E 
A B K 
P 
K N 
L 
S 
L C 
0 
S 
o S C T L SLY 
S C 
C 
EMF 
0 
E T 
W 
T 
T C EST 
Y 
SOl 
P 
T 
C 
R 
P T 
o G 


F 00000000 
00000000 
F 00_ 
2 
1 3 
9F 
1 
1 1 o 1CO 
o 000 
3 3 1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FB 
1 
1 1 o 180 
o 0 0 0 3 3 1 000 
F BE769E51 
EE631E87 
F01_2 
o 3 
FB 
1 
1 1 
o 180 
o 000 
3 3 1 000 
F 00000000 
00000000 
F 00_ 
2 
1 3 
9F 
1 
1 1 
o 1CO 
o 000 
3 3 1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FB 
1 
1 1 
o 180 
o 000 
3 3 1 000 
F 3EBC8075 
30548005 
F 0 
1 _ 
2 
0 3 
FB 
1 
1 1 
o 180 
o 000 
3 3 1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 
o 1CO 
o 0 0 0 3 3 1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FB 
1 
1 1 
o 180 
o 000 
3 3 1 000 
F BEFEE4FO 
234A4926 
F 0 1 _ 
2 
0 3 
FB 
1 
1 1 o 180 
o 0 0 0 3 3 1 000 
F 00000000 
00000000 
F 00_ 
2 
1 3 
9F 
1 
1 1 o 1CO 
000 
0 3 3 1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FB 
1 
1 1 o 180 
o 0 0 0 3 3 1 000 
F 3F3BOB69 
6E8987AC 
F 0 
1 _ 
2 
0 3 
FB 
1 
1 1 o 180 
o 0 0 0 3 3 1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO 
o 0 0 0 3 3 1 000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FB 
1 
1 1 o 180 
o 0 0 0 3 3 1 000 
F BF741839 
EB88156E 
F01_2 
o 3 
FB 
1 
1 1 
o 180 
0000331 
000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 
o 1CO 
0000331 
000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FB 
1 
1 1 
o 180 
0000331 
000 
F 3FA5BE29 
8AOF0369 
F 0 1 _ 
2 
0 3 
Fa 
1 
1 1 
o 180 
o 000 
3 3 1 000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO 
o 000 
3 3 1 000 
F 00000000 
00000000 
FOO_2 
o 3 
FB 
1 
1 1 o 180 
o 000 
3 3 1 000 
F BFCF5E46 
537AB906 
F 0 
1 _ 
2 
o 3 
FB 
1 
1 1 o 180 
0000331 
000 
F 00000000 
00000000 
FOO_2 
1 3 
9F 
1 
1 1 o 1CO 
0000331 
000 
F 00000000 
00000000 
F 0 0 _ 
2 
0 3 
FB 
1 
1 1 o 180 
o 0 0 0 3 3 1 000 
F 3FE6A09E 
667F3BCC 
F 0 
1 _ 
2 
0 3 
FB 
1 
1 1 o 180 
0000331 
000 


Microcode Table for the Exp(x) Calculation (Concluded) 


p 
D 
D 
P 
E 
E C 
P 
C C s 
R H E 
F 
I 
R F S 
B S T S 000 
A 
A 
B 
B 
N N L 
I 
L 0 
E 
E 
A N 
L 
N 
N A R Y 
E 
E 
E 
E 
E 
E 
A 
B K 
P 
K N 
L 
S 
L C 
0 
S 
D S C 
T 
L S L Y S C 
C 
E 
M 
F 
0 
E 
T 
W 
T 
T C 
E S T Y 
S 
01 
P 
T 
C 
R 
P 
T 
D G 


F 00000409 
00000014 
F 
1 1_ 
2 o 1 FF 
1 
1 1 o 228 
0 0 0 0 3 3 1 o 0 0 
F 00000000 
00000000 
F 0 1I 
2 o 3 
FF 
1 
1 0 o 300 
0 0 0 0 3 3 1 o 0 0 
F 40900000 
00000000 
F 0 0_ 
2 o 2 
DF 
1 
1 1 o 1CO 
0 0 0 0 3 3 1 o 0 0 
F 00000000 
00000000 
. 


3 3 
o 0 0 
F 
1 0_ 
2 o 2 
DF 1 
1 1 o 1CO 
0 0 0 0 
1 
F 00000000 
00000000 
F 00_ 
2 o 3 
FF 
1 
1 1 o 180 
0 0 0 0 3 3 1 o 0 0 
F 00000000 
00000000 
F 00_ 
2 o 3 
FF 
1 
1 1 o 300 
0 0 0 0 3 3 1 o 0 0 
F 00000000 
00000000 
F 00_ 
2 o 3 
FF 
1 
1 1 o 300 
0 0 0 0 3 3 0 o 0 0 


High-Speed Vector Math and 3-D Graphics 


Introduction 


The Texas Instruments SN74ACT8847 Floating-Point Processoris designedto execute 
high-speed, high-accuracy mathematical computations. The device is especially suited 
for matrix manipulations such as those used in graphics or digital signal processing. 
This 
FPU multiplies 
and 
adds 
data 
elements 
by 
executing 
sequences 
of 
microprogrammed calculations to form new matrices. The device may be configured 
for either single- or double-precision operation. Single-precision operation is assumed 
throughout this report. 


The' ACT884 7 can perform integer and logical operations and has built-in, hardwired 
algorithms for division and square root operations. 


This application report outlines the timing, data flow, and programming for several 
common data vector calculations and matrix transformations. 
Further, it illustrates 
some of the programming "tricks" 
resulting in fastest operation. Throughout, this 
document compares the timing schemes for programs in which all registers, including 
the ALU and multiplier internal pipeline registers, are enabled ("pipelined" 
mode) with 
those for equivalent programs in which the internal pipeline registers are disabled 
("unpiped" 
model. Equations are provided to help the programmer select the more 
efficient mode, and performance figures are included with times given for 3D-MHz 
operations. 


This report begins by covering simple vector 
arithmetic 
operations, 
which 
are 
categorized as "computational" 
or "compare" 
functions 
for convenience. 
This 
document then compares these operations as they are used in graphics applications 
to perform three-dimensional coordinate transformations, 
perspective viewing, and 
clipping. 


SN74ACT8847 
Floating-Point Units 


The' ACT884 7 floating-point unit (FPUl combines a multiplier and an arithmetic-logic 
unit (ALUl in a single microprogrammable VLSI device. This device is implemented 
in TI's advanced D.8-",m 
EPIC· CMOS technology and is fully compatible with the IEEE 
standard for binary floating-point arithmetic, STD 754-1985, for either single- or double- 
precision operation. 


Instruction 
inputs can select independent ALU operation, independent multiplier 
operation, or simultaneous ALU/multiplier operation. Each FPUcan handle three types 
of data input formats. The ALU accepts data operands in integer format or IEEEfloating- 
point format. 


Data enters the' ACT884 7 through two 32-bit data buses, DA and DB (see Figure 3), 
which can be configured to operate as a single 64-bit data bus for double-precision 
operations. Data can be latched in a 64-bit temporary register or loaded directly into 
the input registers, RA and RB, which pass data to the multiplier and ALU. 


A clock-mode control allows the temporary register to be clocked on the rising or falling 
edge of the clock to support double-precisionALU operations at the same rate as single- 
precision operations. Usingthe temporary register, double-precisionnumbers on a single 
32-bit input bus can be loaded in one clock cycle. 


The input registers RA and RB are the first of three levels of internal data registers. 
Additionally, the ALU and multiplier each have an internal pipeline register and an output 
register. The ALU's output register is denoted by "5" (sum), and the multiplier's output 
register is denoted by "P" 
(product). Any or all of these internal registers may be 
bypassed. 


A 64-bit constant register (C) with a separate clock is provided for temporary storage 
of a multiplier result, ALU result, or constant for feedback to the multiplier and ALU. 
An instruction register and a status register are also included. 


Four multiplexers select the multiplier and ALU operands from the input, C, 5, or P 
registers. Results are output on the 32-bit Y bus; a Y output multiplexer selects the 
most or least significant half of the result for output. 


The' ACT884 7 FPU is fully compatible with IEEEStandard 754-1985 
for addition, 
subtraction, multiplication, division, square root, and comparison. The 'ACT8847 FPU 
also performs integer arithmetic, logical operations, and logical shifts. Additionally, 
absolute value conversions and floating point-to-integer and integer-to-floating-point 
conversions are available. 


For the 'ACT884 7, the ALU and multiplier can operate in parallel to perform sums 
of products and products of sums. Detailed information regarding the instruction inputs 
for the various' ACT884 7 configurations and operations is given in this section. 
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Mathematical 
Processing Applications 


TI's SN74ACT8847 
high-speed 
floating-point 
unit (FPU) is designed 
to perform 
high- 
accuracy, 
computationally-intensive 
mathematical 
operations. 
In particular, 
this FPU 
can meet the computational 
demands of high-end graphics workstations 
and advanced 
signal processing. 
Both applications 
involve 
repetitive 
computations 
on arrays of data 
typically 
expressed 
as vector 
arithmetic 
operations. 


For example, 
the calculation 
of the sum of products, 
or multiply-accumulate 
function, 


is frequently 
used in both signal and graphics 
processing. 
In general 
form, 
the sum 
of products 
equation 
is: 


n 
S = 
1: 
kiXi, for coefficients 
ki and data xi. 


i= 1 


This sum of products 
is the central 
function 
involved 
in multiplying 
matrices. 
Such 
matrices 
might 
represent 
a system 
of linear differential 
equations 
or the geometrical 
transformation 
of a graphic object. 
Specifically, 
an n n matrix 
A multiplied 
by an n x m 
matrix 
B yields 
an n x m matrix 
C whose 
elements 
Cij are given 
by: 


n 
Cij = 
1: 
aikXbkj 
for i = 
1, ... ,n and j = 
1,oo.,m. 
k=1 


The' ACT884 
7 is designed to handle efficiently 
this kind of parallel multiplication 
and 
addition. 


The basic 
principle 
of graphics 
processing 
is that 
any object 
can be reduced 
to a 
combination 
of points, 
lines, and polygons 
and then defined 
as a collection 
of points 


in three-dimensional 
space. Because points, 
planes, transformation 
matrices 
and other 
common 
data structures 
are vectors, 
most of the computations 
involved 
in graphics 
processing 
are vector 
operations. 


Computations 
for a 3-D graphics 
display 
are highly 
involved 
due to the complexity 


introduced 
by the z-axis. 
Viewing 
an object 
from 
a particular 
perspective 
involves 
transforming 
the object's 
world 
coordinates, 
or its coordinates 
in the model 
space, 


into viewing, 
or eyepoint, 
coordinates. 
A series of translations 
and rotations 
map the 
viewing 
system 
axes onto the world 
coordinate 
axes. Each individual 
point 
must 
be 
translated, 
rotated 
and, if necessary, 
scaled in a proper order. 
Once the coordinate 
transformation 
is complete, 
the coordinates 
are clipped to a viewing 
volume. 
Clipping 
algorithms 
employ 
arithmetic 
operations 
to determine 
whether 
an object, 
or part of 
an object, 
is inside or outside 
a pyramidal 
volume. 
Hidden surface 
routines 
may then 
be employed 
to delete surfaces 
that fall behind a "nearer" 
surface 
from the viewer's 
perspective. 


Matrix arithmetic is required for scaling, rotating, translating, or shearing an object, 
as well as for the final process of projecting its visible parts to a two-dimensional frame 
buffer. Any sequence of these transformations can be represented as a single matrix 
formed by concatenating the matrices for the individual operations. The generalized 
4 x 4 matrix for transforming a three-dimensional object is shown below, partitioned 
into four component matrices, each of which produces a specific effect on the image. 
The 3 x 3 matrix produces linear transformation in the form of scaling, shearing, and 
rotation. The 1x 3 row matrix produces translation, while the 3 x 1 column matrix 
produces perspective transformation with multiple vanishing points. The final single- 
element 1 x 1 matrix produces overall scaling. 
l 


3x3 
~] 
T = 
1 
1x3 
1x1 


Overall operation of the matrix T on the position vectors of a graphics object produces 
a combination of shearing, rotation, reflection, translation, perspective, and overall 
scaling. 


Programs that require repetitive computations on multiple sets of operands lend 
themselves to vector-processing algorithms, in which the operands are viewed as 
succeeding elements of long "data vectors." 
The next two sections outline the 
programming for commonly-used vector operations. Most of these examples conclude 
with a comparison of program timing for pipelined (internal pipeline registers enabled) 
and unpiped (internal pipeline registers disabled) operation. For convenience, the 
operations are labeled" computational," which includes simple and compounded adds, 
multiplies, and divides, or "compare," 
which can be used to select maximum or 
minimum values from succeeding pairs of numbers or from a list. 


Computational Operations on Data Vectors 


This section covers the following vector operations: vector add, vector multiply, vector 
divide, sum of products (alsocalled inner, scalar, or dot product), and product of sums. 
Since matrix multiplication is composed of a sequence of sum of products operations, 
these two functions are discussed in the same section. In some cases, a whole class 
of operations is covered under one heading. For example, the vector add operation 
includes sums and differences of Ai, Bi, IAi I, and IBjI in all combinations. 


The vector add operation adds corresponding components of data vectors to obtain 
the components of the output vector. Hence, for input vectors A and B and output 
vector Y, each with N components, 


The' ACT884 7 performs this calculation in unchained, independent ALU mode. 


Table 13 shows the contents of the data registers at successive clock cycles for N = 6 
with the FPUoperating in pipelined mode. Since the data travels by way of the internal 
pipeline register, two cycles pass before the first sum appears in the S register. The 
contents of the internal pipeline register are not given in the flow. 


RA 
Al 
A2 
A3 
A4 
AS 
A6 


RB 
Bl 
B2 
B3 
B4 
BS 
B6 


S 
Al +Bl 
A2+B2 
A3+B3 
A4+B4 
AS+BS 
A6+B6 


P 


C 


Y 
Yl 
Y2 
Y3 
Y4 
YS 
Y6 


ClK 
1 
2 
3 
4 
S 
6 
7 
8 
9 


Data transfers and operations for each clock cycle are summarizedin the program listing 
in Table 14. Detailed information on the instruction inputs required to perform each 
operation is included in sections 5 and 7. Note that the selection of the output source 
(in this case,the S register). which is determined by the 16 instruction bit, is programmed 
along with the ALU or multiplier operation that generates the output. 


MULTIPLIER 
REGISTER TRANSFERS 
ALU OPERATION 
OPERATION 


1. 
lOAD 
RA, RB; 
y-s 
ADD(RA,RB) 


2. 
lOAD 
RA, RB; 
Y-S 
ADD(RA,RB) 


3. 
lOAD 
RA, RB; 
Y-S 
ADD(RA,RB) 


6. 
lOAD 
RA, RB; 
Y-S 
ADD(RA,RBI 


Timing and programming are similar for other independent ALU operations involving 
two operands, such as (A - B). (B- A), and compare (A,B). However, when the compare 
function is used, two status bits must be generated before numeric values can be output 
(see "Compare Operations on Data Vectors"). 


Becausethe vector add program closely parallelsthat for vector multiplication, pipelined 
and unpiped modes for both vector add and multiply are compared in the next section. 


Vector Multiply 


The vector multiply operation multiplies corresponding elements of data vectors to 
obtain the components of the output vector. Hence, for input vectors A and Band 
output vector Y, each with N components, 


Pipelined Mode 


Table 15 shows the contents of the data registers at successive clock cycles for N = 6 
with the FPUoperating in pipelined mode. The product may be replaced by a variety 
of other independent multiplier operations, such as - (A x B), A x IBI, - (A x IBI), 
IA I x IBI, and - (IA I x IBI). Data transfers and operations for each clock cycle are 
summarized in the program listing in Table 16. 


RA 
A1 
A2 
A3 
A4 
A5 
A6 


RB 
B1 
B2 
B3 
B4 
B5 
B6 
S 
p 
A 1 x B1 
A2xB2 
A3xB3 
A4xB4 
A5xB5 
A6x 
B6 


C 


Y 
Y1 
Y2 
Y3 
Y4 
Y5 
Y6 


CLK 
1 
2 
3 
4 
5 
6 
7 
8 
9 


MULTIPLIER 
REGISTER TRANSFERS 
ALU OPERATION 
OPERATION 
1. 
LOAD RA, RB; 
y-p 
MUL T(RA,RBl 


2. 
LOAD RA, RB; 
y-p 
MUL T(RA,RB) 


3. 
LOAD RA, RB; 
y-p 
MUL T(RA,RB) 


6. 
LOAD RA, RB; 
y-p 
MUL T(RA,RB) 


Unpiped Mode 


Table 17 shows the contents of the data registers at successive clock cycles during 
a vector multiply operation for N = 6 with the FPU operating in unpiped mode. The 
vector add operation progresses similarly. Since there is no "single-clocked storage" 
in the internal pipeline register, each product or sum is performed in one cycle. 


RA 
A1 
A2 
A3 
A4 
A5 
A6 
RB 
B1 
B2 
B3 
B4 
B5 
B6 
S 


P 
A 1x B1 
A2xB2 
A3x B3 
A4xB4 
A5xB5 
A6xB6 
C 
Y 
Y1 
Y2 
Y3 
Y4 
Y5 
Y6 
ClK 
1 
2 
3 
4 
5 
6 
7 
8 
9 


Comparison of Pipelined and Unpiped Modes 


For both vector add and vector multiply operations carried out in pipelined mode, results 
are output to the Y bus on clocks 3, ... ,N + 2. In unpiped mode, results are output 
to the Y bus on clocks 2,...,N + 1, thereby saving a cycle. Unfortunately, it is necessary 
to operate at a lower clock rate in unpiped mode than in pipelined mode. The following 
equation can be used to determine which of the two 
modes provides the faster 
performance in a particular application. Pipelined operation is faster if: 


(N + 21/Fp < (N + 11/Fu, 


where Fp and Fu are the clock rates in pipelined and unpiped modes, respectively. 
As of publication, pipelined mode provides faster performance for input vectors with 
N >2. 


The sum of products operation multiplies corresponding elements of data vectors and 
adds the resulting products. The operation is also referred to as the inner product, 
scalar product, or dot product of two vectors, since these are the names for the function 
as it is used in vector algebra. For input vectors A and B, each with N components, 
the sum of products operation yields a single output Y defined as follows: 
N 
Y = 
1: AxB'1 
i= 1 
I 
I 


The' ACT884 7 performs this calculation in chained mode so that concurrent operation 
of the ALU and multiplier is possible. 


Pipelined Mode 


Table 18 shows the contents of the data registers at successiveclock cycles for N 
8 
with the FPU operating in pipelined mode. 


RA 
A1 
A2 
A3 
A4 
A5 
A6 
A7 
A8 
RB 
B1 
B2 
B3 
B4 
B5 
B6 
B7 
B8 
5 
51 
53 
54 
55 
56 
57 
58 
57+8 
P 
P1 
P2 
P3 
P4 
P5 
P6 
P7 
P8 
C 
P2 
P2 


Y 
Y 
ClK 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 


Here, Pi=AiXBi, 
51 = P1 + 0,53 
= P3 + 51,54 
= P4 + P2, 56 = P6 + 54, 
P57 = P7 + 55, and 58 = P8 + 56· The values of the sums could be more succinctly 
expressed as 5i = Pi + 5i - 2 (with 50 = 5 - 1 = 0), except that 52 = P2 + 0 = 
P2 does not actually appear in the data flow as a sum in the 5 register. Instead, the 
C register holds P2 for two cycles. 


This approach, although introducing a certain lack of symmetry into the programming, 
frees up the 5 register at a point allowing the efficient overlap of succeeding sum of 
products operations without any dead cycles. A new sum of products operation can 
begin at ClK 9, and the 5 register remains free to hold the first operation's result in 
ClK 
14. 5imilary, by storing 57 in the C register in ClK 
12, rather than multiplying 
it by one, the P register remains free to hold "P2" for the next pair of data vectors. 
By ClK 
12, 57 = P1 + P3 + P5 + P7 and 58 = P2 + P4 + P6 + P8, so that 
Y = 57 + 58· 


Datatransfers and operationsfor eachclock cycle aresummarizedin the program listing 
in Table 19. 


MULTIPLIER 
REGISTER TRANSFERS 
ALU OPERATION 
OPERATION 


1. 
LOAD RA, RB 
MUL T(RA,RB) 


2. 
LOAD RA, RB 
MUL T(RA,RB) 


3. 
LOAD RA, RB 
ADD(P,O) 
MUL T(RA,RB) 


4. 
LOAD RA, RB; 
C-P 
MUL T(RA,RB) 


5. 
LOAD RA, RB 
ADD(P,5) 
MUL T(RA,RB) 


6. 
LOAD RA, RB 
ADD(P,C) 
MUL T(RA,RB) 


7. 
LOAD RA, RB 
ADD(P,5) 
MUL T(RA,RB) 


8. 
LOAD RA, RB 
ADD(P,5) 
MUL T(RA,RB) 


9. 
ADD(P,5) 


10. 
ADD(P,5) 
11. 
C-5 


12. 
V-5 
ADD(5,C) 


The above algorithm imposes no delay between input vectors. The time required to 
carry out the sum of products operation on M pairs of input vectors in succession, 
each of length N, is Nx M + 6 cycles. 


Unpiped Mode 


In the unpiped version of the sum of products, the data flow is more straightforward. 
Again, chained mode is employed to allow the AlU 
and multiplier to operate 
concurrently. Table 20 shows the contents of the data registers at successive clock 
cycles for N = 8 with the FPUoperating in unpiped mode. Here, Pi = Ai x Bi, and 
Si = So- 1) + Pi, with So = O. 


RA 
A1 
A2 
A3 
A4 
A5 
A6 
A7 
A8 


RB 
B1 
B2 
B3 
B4 
B5 
B6 
B7 
B8 


5 
51 
52 
53 
54 
55 
56 
57 
58 


P 
P1 
P2 
P3 
P4 
P5 
P6 
P7 
P8 


C 


V 
V 


CLK 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 


A new problem can be presented at elK 9 without any delay between the vectors. 
Therefore, the time required to compute the sums of products for M pairs of vectors, 
each of length N, is Nx M + 2 clock cycles. 


Comparison of Pipelined and Unpiped Modes 


The following equation can be used to determine which of the two modes provides 
the faster performance in a particular application. Pipelined operation is faster if: 


(MXN+61/Fp 
< (MxN+21/Fu, 


where Fp and Fu are the clock rates in pipelined and unpiped modes, respectively. 
Becausethe unpiped mode's longer clock cycle usually outweighs its savings in cycles, 
pipelined mode provides faster performance for input vectors with N >4. 


The product of sums operation adds corresponding elements of data vectors and 
multiplies the resulting sums. For input vectors A and B, each with N components, 
the product of sums operation yields a single output Y defined as follows: 


N 
Y = 
7r (Ai + Bil 
i= 1 


The product of differences can be computed by simply makingthe ALU operation (A - Bl 
or (B- AI. The' ACT884 7 performs this calculation in chained mode so that concurrent 
operation of the ALU and multiplier is possible. The data flow and program listing for 
the product of sums are identical to those for the sum of products, except that the 
roles of add and multiply are reversed. The criteria used to decide between pipelined 
and unpiped modes are also identical to those previously given. 


The vector divide operation divides corresponding elements of data vectors to obtain 
the components of the output vector. Hence, for vectors A and B and output vector 
Y, each with N components, 


The' ACT8447 performs this calculation using the Newton-Raphson iterative method. 
In the 'ACT8847, 
the divide algorithm is built in. 


A General Principle 


The vector divide example illustrates a general programming principle that should be 
considered whenever a program begins a new instruction every other cycle. In cases 
where the C register is not used, it is simple to interleave another program, even one 
not performing the same function. 


Interleaving programs is not as easy if the C register is used because the C register 
is the only nonpiped register. However, even using the C register, programs may often 
be interleaved by staggering one against the other so that their use of the C register 


does not overlap in time. Many of the programs so far discussed can be thought of 
as two such interleaved programs, with the C register being used to delay the first 
result until it can be combined with the second. (See,for example, the sum of products 
operation. ) 


SN74ACT8847 
Vector Divide 


Since the' ACT884 7 has a built-in algorithm for divide, Table 21 shows the data flow 
for pipelined operation. Data transfers and operations are summarized in the program 
listing in Table 22. 


Table 21. Data Flow for 'ACT884 7 Pipelined Single-Precision 
Vector Divide 


RA 
A1 
A2 


RB 
B1 
B2 
S 


P 
A1/B1 


C 
y 
Y1 


ClK 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 


Table 22. Program Listing for' ACT8847 Pipelined Single-Precision 
Vector Divide 


REGISTER TRANSFERS 
ALU OPERATION 
MULTIPLIER 


OPERATION 


1. 
lOAD 
RA, RB; 
y-p 
DIVIDE 


7. 
lOAD 
RA, RB; 
y-p 
DIVIDE 


13. 
lOAD 
RA, RB; 
y-p 
DIVIDE 


Note that the microinstructions 
are presented on the steps indicated (1, 7, 13, ...), 


with a six-cycle lapse before the next operands can be input to RA and RB. Performing 
a vector divide of two N-element single-precision vectors takes (6N + 2) cycles in 
pipelined mode. M such pairsof vectors would require [6(N x M) + 21 cycles in pipelined 
mode. In unpiped mode, the equation is 7(N x M). 


Compare Operations on Data Vectors 


In independent AlU mode (unchained), two operands may be compared for equality 
(A = B) and order (A > B). Additionally, the absolute values of either or both operands 
may be compared. The compare function uses two status bits, the AGTB and AEQB 
output signals. (When any operation other than a compare is performed, either by the 
AlU or the multiplier, the AEQBsignal is usedas a zero detect. Hence, numerical results 
cannot be output in the same cycle in which comparison status is output.) 


For greatest efficiency, programs for compare operations should be written without 
requiring conditional branches in the sequencer. If branches can be avoided, the 
microcoding is simplified and the programs are immediately scalable to SIMD systems 
employing many' ACT884 7 chips. 


The vector maximin operations compare corresponding elements of data vectors and 
select the maximum or minimum value to obtain the components of the output vector. 
Hence, for input vectors A and B and output vector Y, each with N components, 


Pipelined Mode 


Table 23 shows the suggested data flow for a pipelined vector MAX operation, where 
Yi is set to the max of (Ai, Bi) for all i. Included are rows to indicate the setting of 
the chain mode instruction bit (110for the' ACT884 7) and the status bit being sensed. 


CHAIN 
N 
Y 
Y 
Y 
N 
Y 
Y 
Y 
N 
Y 
RA 
A 
A1 
81 
A2 
A2 
82 
A3 
A3 
R8 
81 
82 
83 
S 
A1 
81 
A2 
82 
p 
A1 
A2 
C 
Y 
Y1 
Y2 
STATUS 
A>8 
A>8 
ClK 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 


A comparison starts at ClK 
= 1, 5, etc., when the chain-mode instruction bit is low. 


The result appears at ClK 
= 3, 7, etc., indicated by the AGTB and AEQB signals. 
AGTB is saved off-chip for use as instruction bit 16(output source) at ClK 4, 8, etc. 
This value for 16selects the output source, either the multiplier or the AlU result, at 
ClK 6, 10, etc. For example, if a comparison result is A > B, the AGTB signal goes 


high and is used to set 16 high. 16 then selects the multiplier result (Ai) to output. 
Similarly, if A ::S B,AGTB and 16 arelow, andthe AlU result (Bi)is output. The circuitous 
route taken by Ai on the way to the P register is necessary because it is not possible 
to pass RA or RBthrough the multiplier in parallel with passing the other through the 
AlU. 


The program is not particularly well-packed and produces the vector max of a pair 
of vectors of length N in (4N + 2) cycles. For M pairs of vectors of length N, the total 
time is (4MN + 2) cycles. The program can be improved by applying the interleaving 
principle previously discussed. The steps are rearrangedso that a new operation begins 
every other cycle, thus allowing two compare programs to be interleaved. Table 24 
shows the suggested data flow for a pipelined vector minimax operation, where 
Yi = MAX/MIN(Ai, 
Bi) and Zi = MAX/MIN(Ci, Oil. 


CHAIN 
N 
N 
Y 
Y 
Y 
Y 
N 
N 
Y 
Y 
Y 
Y 
N 
N 
RA 
A1 
C1 
A1 
C1 
B1 
01 
A2 
C2 
A2 
C2 
B2 
02 
RB 
B1 
01 
B2 
02 
S 
A1 
C1 
B1 
01 
A2 
C2 
B2 
02 
p 
A1 
C1 
A2 
C2 
C 
Y 
Y1 
Z1 
Y2 
Z2 
STATUS 
A>B 
A>B 
A>B 
A>B 
ClK 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 


Again, Ai (and Ci) reaches the P register by an indirect route. However, this tighter 
program performs M vector comparisons, two 
vector comparisons at a time, in 
[6 x Nx CEILlNG(M/2) + 2] cycles. (As previously defined, the ceiling function rounds 
to the next highest integer for fractional values.) In this example, two separate vector 
comparisons on two-dimensional vectors are performed, giving 6 x 2 x 1 + 2 = 14 
cycles. For M = 2 pairs of vectors, all of length N,the 
second program is as good 
as the first. For M > 2, the interleaved program performs increasingly better as M 
gets larger. 


This second program requires more off-chip logic, since the status outputs at ClK 3 
and 4 must be saved separately off-chip for use at ClK 5 and 6, respectively. This 
problem can easily be avoided by starting the calculations on the second pair of vectors 
two cycles later than shown (i.e., at ClK 4). The time necessary to perform the vector 
MAX operation on M pairs of N-dimensional vectors, two pairs concurrently, then 
increases to [6 x Nx CEILlNG(M/2) + 4] cycles. 


Data transfers and operations for the odd lines only are summarized in the program 
listing in Table 25. The complete program is obtained by repeating the equivalent of 


. each odd-numbered line in the next even line for the second pair of vectors. 


Table 25. Program Listing for Pipelined Single-Precision Interleaved 
Vector MAXIMIN 


MULTIPLIER 
REGISTER TRANSFERS 
ALU OPERATION 
OPERATION 


1. 
LOAD RA, RB 
COMPARE(RA,RB) 


3. 
LOAD RA 
ADD(RA,O) 


5. 
LOAD RA; 
Y+-P/S 
ADD(RA,O) 
MULT(S,1) 


Unpiped Mode 


Table 26 shows the data flow for an unpiped vector MAX operation. 


CHAIN 
N 
Y 
Y 
N 
Y 
Y 
N 
Y 
Y 


RA 
A1 
A1 
B1 
A2 
A2 
B2 
A3 
A3 
B3 


RB 
B1 
B2 
B3 


S 
A1 
B1 
A2 
B2 
A3 


P 
A1 
A2 


C 


Y 
Y1 
y2 


STATUS 
A>B 
A>B 
A>B 


CLK 
1 
2 
3 
4 
5 
6 
7 
8 
9 


The status bit is saved off-chip at ClK = 2, 5, etc., and used at ClK = 3, 6, etc., 
as the 16bit of the instruction. 16selects either the multiplier or AlU result to output 
to the Y bus at ClK 
= 4, 7, etc. 


The program computes the vector comparison of M pairs of vectors of length N in 
[3 x M x (N + 1II cycles. 


Comparison of Pipelined and Unpiped Operation 


Pipelined operation is faster if: 


[6 x Nx CEILlNG(M/21 + 21/Fp < (3 x M x N + 11/Fu, 


where Fp and Fu are the clock rates in pipelined and unpiped modes, respectively. 
As of publication, pipelined mode provides faster performance for M > 1. 


The list maximin operations select the maximum or minimum value, Z, of a list of N 
elements. Hence, for input vector A with N components and output Z, 


list minimax is an essential operation in computer graphics because it is used to find 
the "extents" 
of a polygon or polyhedron. The extents are the maximum values of 
X, Y, and Z among the list of vertices for the object in question. Many forms of 
comparison are possible since the absolute value of either or both AlU operands may 
be employed. However, the example in this section assumes that the largest element 
of a list of N elements is desired. 


Pipelined Mode 


Table 27 shows the data flow for a pipelined list MAX operation, 
where M1 = MAX(A1, A2); Mi = MAX[Mi -1), 
A(i + 1)1. 2 :5 i :5 N - 
2. 


CHAIN 
y 
N 
Y 
Y 
Y 
N 
y 
y 
y 
N 
y 
y 
y 
y 
y 
y 


RA 
A1 
A1 
A2 
A3 
A3 
A4 
A4 
RB 
A2 
S 
A1 
A2 
M1 
M2 
M3 
p 
A1 
A3 
A4 
C 
M1 
M1 
M2 
M2 
M3 
y 
M3 
STATUS 
A>B 
A>B 
A>B 
ClK 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 


As with vector comparison, the maximin of the absolute values is available, since the 
chip operates in independent AlU mode on the comparison steps. The comparison 
is between the RA register and the RS register in step 2 and between RA and C in 
steps 6, 10, etc. In these steps, the chip is switched into unchained, independent AlU 
mode. The status is saved off-chip and used to set the SRCC signal, which selects 
whether the P or S data goes into the C register in steps 5, 9, etc. 


When the list max is in the C register, at ClK = 4N - 2, the C register contents must 
then be passed through one of the functional units to the output. The MAXIMIN of 
an N-element list therefore takes 4N cycles. M such vectors can be processed in 
[M(4N - 
1) + 1] cycles. 


Data transfers and operations for the list max operation are summarized in the program 
listing in Table 28. The program is carried out in pipelined mode, alternating between 
unchained and chained modes. The list max reaches the output in cycle 4N. 


MULTIPLIER 
REGISTER TRANSFERS 
ALU OPERATION 
OPERATION 
1. 
LOAD RA 
ADD(RA,O) 


2. 
LOAD RA, RB 
COMPARE(RA,RB) 


3. 
LOAD RA 
ADD(RA,O) 
MULT(S,1) 


4. 


5. 
C-P/S 
6. 
LOAD RA 
COMPARE(RA,C) 
7. 
LOAD RA 
ADD(C,O) 
MULT(RA,1) 
8. 


9. 
C-P/S 


REPEAT STEPS 6 THROUGH 9 UNTIL STEP 4N - 
2 IS REACHED, THEN: 


4N - 
2 
Y-S 
I ADD(C,O) 
I 


The equivalent unpiped programtakes [M(3N - 
1)+ 1] cycles. Pipelinedmode is fastest 
if: 


[M(4N - 
1) + 1l1Fp < [M(3N - 
1) + 1l1Fu, 


where Fp and Fu are the clock rates in pipelined and unpiped modes, respectively. 
As of publication, pipelined mode provides faster performance for all M and N. 


This section summarizes the concepts related to creating a three-dimensional image 
and examines a few of the matrix operations used in three-dimensional graphics 
processing. 
These operations 
include 
coordinate 
transformations 
and clipping 
operations. Additionally, this section illustrates some of the programming techniques 
used to perform these operations. 


Conceptually, translating 3-D images to 2-D display screens involves defining a view 
volume that limits the scope of the vista the viewer can see at one time. For simplicity, 
a standardized frame of reference, in which the viewer's eye is located at the origin 
of the coordinate system, is adopted in this example. 


As illustrated in Figures 4(a) and 4(b). the arbitrary world coordinates of the objects 
under scrutiny are transformed into normalized "viewing" 
or "eye" coordinates that 
reflect this frame of reference. Once the normalizing transformation is complete, the 
images within the view volume are projected onto a 2-D view plane, which is assumed 
to be located, like a projection screen, at a suitable relative distance from the viewer 
(see Figures 4(c) and 5). 


A basic model for creating a 3-D view, illustrated in Figure 6(a). transforms arbitrary 
world coordinates to normalized viewing coordinates and then "clips" 
the image to 
remove lines that do not fall within the normalized view volume. Clipping is followed 
by projecting the image to the 2-D projection plane (or "window"). 
The image is then 
mapped onto a canonical 2-D viewport display and from there onto the physical device. 


To 
incorporate 
image transformations, 
another 
model 
must 
be adapted 
[see 
Figure 6(b)]. After clipping, instead of projecting to the view plane, a perspective 
transformation is performed on the clipped viewing coordinates, transforming the view 
volume into a 3-D viewport, the "screen system" 
in which image transforms are 
performed. Then the image is projected to the 2-D viewport display and onto the 
physical device. 


In both models, the clipping operation is performed on coordinates in the viewing 
system. This approach is referred to as "clipping in the eye system." 
In practice, 
clipping is often performed after transformation 
to the screen system. A trivial 
accept/reject test is performed on viewing coordinates, the image is transformed to 
the screen system, and then clipping is performed. 


NOTES: A. In a sequence of transformations, the world coordinate positions for the house are transformed 
into the normalized viewing coordinate system (also called the eye system). For clarity, the house 
is pictured outside the view column. Also shown are the direction vectors VUP (view up), VPN 
(view normal). and VUP' (the projection of VUP parallel to VUN onto the view plane. 


B. After a series of translations, rotations, and shearing and scaling operations, the view volume 
becomes the canonical perspective projection view volume, which is a truncated pyramid with 
apex at the origin, and the house has been transformed from the world to the viewing coordinate 
system. 
C. This figure illustrates the projection of the house from the perspective of the viewer, with eye 
located at the origin of the coordinate system. 


J. D. Foley and A. Van Dam, Fundamentals of Interactive Computer Graphics, Addison-Wesley Publishing 
Company, Reading, MA, 1982,291-293. 
Reprinted with permission. 


The following 
sections illustrate programming techniques used in both of these 
approaches to normalizing, clipping, and transforming a 3-D image. The operations 
are grouped as "3-D Coordinate Transforms," 
"Clipping in the Eye System," 
and 
"Clipping in the Screen System." 


PROJECTION 
PLANE 


VIEWING 
IEYE) 
COORDINATE 
SYSTEM 


Adapted with permission from a paper by Stephen R. Black entitled "Digital Processing of 3-D Data to Generate 
Interactive Real-Time Dynamic Pictures" from Volume 120 of the 1977 SPIE journal "Three Dimensional 
Imaging. " 
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Figure 6(a) Model of Procedure for Creating a 3-D Graphic 
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One of the computationally-intensive 
functions of a 3-D computer graphics system 
is that of transforming points within the object space, such as translating an object 
or rotating an object about an arbitrary axis. Equally complex is the transformation 
of points within the object space (or" world coordinate system") 
into points defined 
by a particular perspective and located within the viewing space (or" eye coordinate 
system"). This latter process, known as the viewing transformation, generates points 
in a left-handed cartesian system with the eye at the origin and the z-axis pointing 
in the direction of view. The arbitrary world-system view volume and the objects therein 
are translated, rotated, sheared, and scaled to match the predefined, canonical view 
volume of the eye system. 


For a "realistic" 
image, the canonical view volume will be a truncated pyramid that 
mimics the cone of vision available to the human eye. Alternatively, 
the volume can 
be a unit cube. The series of operations that make up each transformation 
differ, but 
if homogeneous coordinates are used, either transformation 
can be expressed as a 
simple matrix multiply. 


For each point (X, Y, Z) in the world system, a projection in homogeneous coordinates 
is denoted by (Xh, Yh, Zh, Wh) where, 


(Xh, Yh, Zh, Wh) = (X x Wh, Y x Wh, Z x Wh, Wh). 


and Wh is simply a scale factor, typically unity when floating-point numbers are used. 
(With fixed point values, nonunity valuesof Wh areusedto maximizeuseof the numeric 
range.) To transform a point in homogeneous coordinates, it is post-multiplied by a 
4 x 4 transform matrix: 


[Xh',Yh',Zh',Wh'] 
= [Xh,Yh,Zh,Wh]X 
[A11 A12A13A14~ 
A21 A22 A23 A24 
A31 A32 A33 A34 


. 
A41 A42 A43 A44 


The transform matrix is constructed by multiplying together a sequence of matrices, 
each of which performs a simple task. The product of 4 or 5 elementary matrices may 
be used to perform some complex overall operation on a set of points representing 
an object or an entire scene. Once constructed, the transform matrix is used on each 
point of the object to be transformed. 


This section describes two approaches to the viewing transformation-the 
general 
case and the specific yet typical case in which a reduced version of the transform 
matrix may be used. Performance times are given for 30-MHz frequencies, which 
roughly correspond to the operating speeds of the' ACT884 7. 


Table 29 shows part of the data flow for the pipelined and chained program for the 
product of the homogeneous point [X, Y, Z, W] and the 4 x 4 transform matrix A. 


Table 29. Partial Data Flow for Product of lX, Y, Z, W] and 
General Transform Matrix 


RA 
X 
Y 
Z 
W 
X 
Y 
Z 
W 
X 
Y 


RB 
A 11 
A21 
A31 
A41 
A12 
A22 
A32 
A42 
A13 
A23 


5 
51 (1) 
53(1 ) 
54(1) 
51 (2) 
T1 


P 
P1(1) 
P2(1 ) 
P3(1) 
P4(1 ) 
P1(2) 
P2(2) 
P3(2) 
P4(2) 
C 
P2(1 ) 
P2(1 ) 
53(1 ) 
P2(2) 
P2(2) 


Y 
X' 


ClK 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 


The technique is that already illustrated for the sum of products operation. The numbers 
in parentheses 
indicate 
which 
column of the transform 
matrix 
is involved 
in the 
operation. Here, P1(i) = XxA1i, 
P2(i) = YxA2i, 
etc. 51(i) = P1(j) + 0, 53(i) = 51(j) 
+ P3(i), 54(i) = P2(j) + P4(j), and Ti = 53(j) + 54(j). T1 = X', T2 = y', T3 = Z', 
T4 = W'. As in the sum of products illustration, 
in order to make the most efficient 
use of the 5 register, P2 is used directly 
instead of summing by 0 to form 52. 


The time to transform 
N points in a system is 16N + 6 cycles. The system can 
transform 
approximately 
1.875 million points per second at a clock rate of 30 MHz. 


Because viewing transformations 
are frequently carried out using a single-vanishing- 
point perspective, the 3 x 1 column t,hat performs perspective transformations 
with 
multiple vanishing points is often not used. Additionally, 
with Wh = 1, the 1 x 1 scale 
factor is often equal to one. In these cases, the transform matrix takes the following 
form: 


[ 


... o~ 
... 
0 
... 
0 


.... 
1 


With multiple vanishing points, and in other graphics operations such as clipping, 4 x 4 
matrices are used with nonzero values in the fourth column. The transform 
matrix is 
termed "reduced" 
when its fourth column is the same as that previously shown. In 
such cases, the transform of each point requires only 9 multiplications and 9 additions. 


Table 30. Partial Data Flow for Product of [X, Y, Z, W] and Reduced 
Transform 
Matrix 


RA 
X 
y 
Z 
x 
X 
Y 
Z 
x 
X 


RS 
All 
A21 
A31 
A41 
A12 
A22 
A32 
A42 
A13 


5 
51 (1) 
52(1 ) 
T1 


P 
Pl (1) 
P2(l) 
P3(l) 
Pl (2) 
P2(2) 
P3(2) 


Pl (1) 
P2(l) 
51 (1) 
Pl (2) 
P2(2) 
y 
X' 
elK 
1 
2 
3 
4 
5 
6 
7 
8 
9 


Again, the numbers in parentheses refer to the column of the transform matrix involved 
in the operation. In this case, however, only the first three columns are used. Hence, 
for 1 :s i :s3, P1(i) = X x A 1i, P2(i) = Y x A2i, etc. 51 (j) = P1(j) + A4i, 52(i) = P2(j) 
+ P3(j), and Ti = 51(i) + 52(j). T1 = X', T2 = Y', T3 = Z'. Note that W values 
are not calculated since they are all 1. 


The time to transform N points in a system is (12N + 5) cycles. The system can 
transform 2.5 million points per second at 30 MHz. 


Once an image is transformed into viewing coordinates, it must be clipped so that 
lines extending outside the view volume are removed. There are several approaches 
to clipping, some more efficient than others. This section surveys the most commonly 
used techniques and estimates the throughput of several single- and multi-processor 
arrangements. 


First considered is the technique of fully clipping the line segments to fit within the 
viewing pyramid in the eye coordinate system. This technique is commonly referred 
to as "clipping before division." 


Clipping in the screen system is considered second. This method eliminates lines that 
are obviously invisible in the eye system 


If an object is composed of straight line segments and a perspective view is to be 
taken, the viewing volume is a pyramid defined by the following plane equations: 


where K is a constant to be defined below. Thus, - KZ < (X,Y) < KZ. Two other 
clipping planes are usually employed at Z = Nand Z = F, where Nand F are the near 
and far limits, respectively, of the view. This gives: 


Looking in the direction of the z-axis (see Figure 78), the eye can imagine a screen 
located at a distance N from the eye. K is formed from the half-screen height divided 
by N. A specific line segment might intersect any or all of the six clipping planes. One 
common approach to this problem is to use six processors in a pipeline, each clipping 
the line to one plane. 


IY 
- 
k, plana 


k -(~) 


FAR VIEWING 


LIMIT 


Consider the case of clipping the line defined by the points P1 = (X1, Y1, Z1) and 
P2 = (X2, Y2, Z2) against the Z = N plane. First computed are (Z1 - 
N) and 
(Z2 - 
N). If both are negative, the line is invisible, and a notation meaning an empty 
line is passed on. If both are positive, both ends of the line are on the visible side of 
the Z = N plane, and the line is passed on unclipped. 


When one of these computed values is negative and the other positive, the line must 
be clipped and the new values for its endpoints passed down the rest of the pipeline. 
To do so, a parameter t that indicates what fraction of a segment Z1Z2, and therefore 
of P1P2 as a whole, lies on the P1 side of the Z = N plane, is computed as follows: 


In general, the value of the parameter is derived as described in Newman and Sproull1, 
using the following equations of the line: X = X 1+ (X2 - X1)u; Y = Y1+ (Y2 - Y1)u; 
Z = Z1 + (Z2 - Z1)u. These equations are each inserted into the corresponding plane 
equation. In the current example, N = Z1 + (Z2 - 
Z1It. 


Since N is between Z1 and Z2, t is always positive, and the signs of Z1 - 
Nand 
Z2 - 
N are used to determine which end to clip. If Z1 - 
N is negative, the P1 end 
is clipped, using the value of t to determine the delta in X1 and Y1. The coordinates 
for the new endpoint of the shortened line segment are given by: 


An alternative to clipping to one plane at a time entails clipping to all six planes at 
once. Both approaches are examined in the following sections. 


Clipping to One Plane at a Time 


When a pipeline of six processors is used, each clipping the same line to one plane, 
each processor must wait for data from the previous processor and hold its solution 
until the next processor is ready to receive it. There is no reason to seek shortcuts 
through the computations by including branches in the program becausethere is little 
point in one of the processors completing its task earlier than the rest. This statement 
is true whether the six processors are driven from the same or from separate 
sequencers. Similarly, 
operating the 
pipeline asynchronously 
buys 
little 
time. 
Synchronous operation in the case of a clipping pipeline is likely to be almost as fast 
as, and much simpler and cheaper than, asynchronous operation. 


Because shortcuts are not beneficial, the program can be written 
assuming the 
maximum amount of work will be required at each stage, whether the line requires 
clipping at that stage or not. If it is assumedthat invisiblelinesarecaught and eliminated 
as a separate, initial computation, branchesfrom the clipping pipelinecan be eliminated 
entirely. An alternative approach, in which brancheswould be beneficial, involves using 
two, 
three, or more 'ACT884 7 chips in parallel, rather than as a pipeline, each 
performing all six stages of clipping for individual lines. The program lends itself to 
this approach because the computations in each stage of the clipping pipeline are 
identical. 


The method for clipping a line segment against the Z = N plane as one stage in a 
clipping pipeline, assuming invisible lines have been previously eliminated, will be 
illustrated. Two t values are computed tl 
for clipping the Pl end of the line segment 
and t2 for clipping the P2 end. If Zl < N, tl 
= (Zl 
- 
N)/(Zl 
- 
Z2);otherwise, tl 
= O. If Z2 < N, t2 = (Z2 - 
N)/(Zl 
- 
Z2); otherwise, t2 = O. The new endpoints 
for the line segment are computed as follows: 


Xl' 
Xl 
+ (X2 
Xl) 
x tl, 
Yl ' 
Yl 
+ (Y2 
Yl) 
x tl, 
Zl ' 
Zl 
+ (Z2 
Zl ) x tl 


X2' 
X2 
(X2 
Xl) 
x t2, 


Y2' 
Y2 
(Y2 
Yl) 
x t2, 


Z2' 
Z2 
(Z2 
Zl ) x t2· 


Note that the denominator is the same in the equations for tl andt2; it is this reciprocal 
computation that is expensive in time. In the' ACT884 7, the built-in divide is very fast. 


t1 
and t2 
are calculated 
by 
step 
18, 
and the 
entire 
operation 
completes 
in step 27. The data flow is shown in Table 31. A six-processor 'ACT8847 system 
operating at 30 MHz would clip 1.2 million line segments per second with a new 
operation beginning every 25 cycles. 


Table 31. Data Flow for Clipping a Line Segment Against the Z -= N Plane 
Using the SN74ACT8847 


RA 
Z1 
Z1 
Z2 
X2 
0.5 
Y2 
H1' 
H2' 


RS 
Z2 
N 
N 
X1 
d 
SAME 
AS FOR 


X2- 
Y2- 
'8837 
S 
d 
H1 
H2 
H1' 
H2' 
X1 
Y1 


P 
1/D 
t1 
t2 


C 
H1 
H2 
1/D 
t1 
STEPS 


X2- 
Y2- 
20 
Y 
d 
H1' 
H2' 
X1 
Y1 
THRU 
STATUS 
28 
ClK 
1 
2 
3 
4 
5 
6 
7 
8 
14 
15 
16 
17 
18 


Since the performance levels obtained from the six-chip systems described below are 
slower than the rate of endpoint transformation by a single-chip system, some further 
speed improvement is desirable.Hence, rather than going through the code for clipping 
to the X and Y planes, another approach is proposed. 


The "window edge clipping method" derived in Newman and Sproull can be used to 
clip to all six planes at once. Recall that the viewing volume for a perspective view 
is a pyramid defined by the following plane equations: 


where K = SIN, as defined in a previous section. Given a segment with endpoints 
P1 = (X1, Y1, Z1) and P2 = (X2, Y2, Z2), to perform the entire clipping operation 
on all six planes at once, the following two six-tuples must be computed: 


0=(W1 
+X1,W1-X1,W1 
+Y1,W1-Y1,Z1-N,F-Z1)=(01,02, 
1. 


R= (W2 +X2,W2 -X2,W2 
+ Y2,W2 - Y2,Z2 -N,F -Z2) = (R1,R2, 
), 


where W1 = KZZ1 and W2 = KZ2. 


Consider the case where X1 < W1. Then, W1 +X1 < 0; i.e., 01 < O. In general, 
a negative element of 0 indicates that P1 is on the invisible side of one of the clipping 
planes, while a negative element of R indicates the same for P2. To clip the line, the 
six parameters ti for clipping the P1 end and the six parameters si for clipping the P2 
end are computed. Here, ti20 = 200i/(Oi 
- 
Ri) and si = Ri/(Ri - 
Oil. (Again, the 
equations of the line as described in Newman and Sproull are used). 


For example, to find the value t1 for clipping P1 to the X = - W 
following equation is used: 


t1 
= (X1 + W1 )/[(X1 
+ W1) - (X2 + W2)) 


In general, ti = 0i/(Oi 
- 
Ri)· Similarly, si = Ri/(Ri- Oil. 


To actually carry out the computations of ti and si, the trick discussed above is 
performed, and each element of 0 and R is replaced with the difference of the element 
and its absolute value, to form 0' and R'. That is, 


OJ' 
2 x 0i if 0i < 0, and Oi' = 0 otherwise. 
Ri' 
2 x Ri if Ri < 0, and Ri' = 0 otherwise. 


Next calculated is ti 
= 
0i'/[2(Oi 
- 
Ril) and si 
= 
Rj'/[2(Ri - 0i)). 
followed 
by 
T1 = MAX(til and T2 = 1 - 
MAX(Si). The P1 end is clipped using T1 and the P2 
end is clipped using T2. 


In an 'ACT884 7 three-processor parallel system, in which each processor is given the 
task of computing two ti and two si values, computing the 0i' and Ri' values takes 
14 cycles, with the values of 0i - 
Ri computed by step 13. The six divides, 0.5/(Oi 
- 
Ri!. are completed in step 30, assuming an 8-bit seed ROM is used. The maximin 
operations take place in parallel in two processors and complete at step 54 (24 + 
30!. and the new endpoints are ready by step 60 (6 + 54). 


The data flow and program listing for computing t1, t2, s1, and s2 by one of the three 
'ACT8847 
processors is given in Tables 32 and 33. 


CHAIN 
Y 
Y 
Y 
Y 
Y 
N 
N 
N 
Y 
Y 


RA 
K 
K 
W2 
Q1 


RB 
21 
22 
X1 
X1 
X2 
X2 
R1 


S 
Q1 
Q2 
R1 
Q1' 
Q2' 
R1 ' 


P 
W1 
W2 


C 
W1 
W2 
Q1 
Q2 
R1 


Y 
Q1' 
Q2' 
R1 ' 


STATUS 


ClK 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 


CHAIN 
y 
N 
Y 
Y 
Y 
Y 
Y 
Y 
Y 
Y 


RA 
Q2 
RO 


RB 
R2 
d1 


S 
R2 
Q1 -R1 
Q2-R2 
R2' 
RO 
TO 


P 
dxRO 
RO 
R1 


C 
R2 


Y 
Q1 -R1 
Q2-R2 
R2' 


STATUS 


ClK 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 


CHAIN 
y 
y 
y 
y 
y 
y 
y 
y 
N 
N 


RA 
Q1' 
Q2' 
R1 ' 


RB 
o-s 
R2' 


S 
R1 
T1 
1/02 


P 
d x R1 
0- SR1 
1/01 
1/02 
t1 
t2 
S1 
S2 


C 
1/01 
1/01 


Y 
t1 
t2 
S1 
S2 


STATUS 


ClK 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 


NOTE: Cycles 13, 15, 17, 19, ... 
,25 compute 
1/D1 = 0.5/d1; 
Cycles 14, 16, 18,20, 
... 
,26 compute 
1/D2 = 0.5/d2, 
di = Oi-Ri. 


Table 33. Program listing for Three-ProcessorClip to Compute t1, t2, s1, 
and s2 Only 


MULTIPLIER 
REGISTER TRANSFERS 
ALU OPERATION 
OPERATION 


1. 
LOAD RA, RB 
MUL T (RA,RB) 


2. 
LOAD RA, RB 
MUL T (RA,RB) 


3. 
LOAD RB 
C-P 
ADD (P,RB) 


4. 
LOAD RB 
C-P 
ADD (C, -RB) 


5. 
LOAD RB 
C-S 
ADD (C,RB) 


6. 
Y-S 
C-S 
ADD (C,-ICIl 


7. 
Y-S 
C-S 
ADD (C,-ICI 


8. 
Y-S 
ADD (C,-ICII 
9. 
LOAD RA, RB 
ADD (RA, - RB) 


10. 
LOAD RA, RB 
Y-S 
ADD (RA, - RB) 


11 . 
LOAD RB, RB 
Y-S 
C-S 
ADD (RA, - RB) 


12. 
Y-S 
ADD (C,-ICI) 


CODE FOR TWO DIVISIONS 


25. 
LOAD RA 
Y-S 
C-P 
MULT (RA,P) 


26. 
LOAD RA 
Y-S 
ADD (P,Ol 
MULT (RA,P) 


27. 
Y-S 
MULT (RA,C) 


28. 
Y-S 
MULT (S,RBI 


This approach facilitates the transform of 576,000 line segments in a 3-chip 'ACT8847 
system running at 30 MHz. If branches are permitted in the sequencer, a considerable 
speedup is available for situations in which a large proportion of line segments are 
either invisible, and may be eliminated, or are completely visible, and may be passed 
without clipping. A single-processor system takes no more than 32 cycles, sometimes 
as few as 10 cycles, to reject an invisible line, whereas it takes 91 cycles to process 
lines that need both ends clipped. Hence, in a situation where 50% of the line segments 
are invisible, the speed is in excess of 540,000 
segments/second at 30 MHz. It is 
not uncommon for 80% of linesto be invisible, in which case, the speedwould increase 
to 877,000 
line segments at 30 MHz. 


To take advantage of this speedup, the only change in the sequence given above is 
that while computing Q and R, the logical AND and OR is formed for the signs of the 
corresponding pairs of values, Qi and Ri. 


Sign 0i 
Sign Ri 
Sign A - 
-OjX 
IRil 
Sign B - IOil xRj 
A>B 
A- 
B 
- 
- 
+ 
- 
T 
F 
- 
+ 
+ 
+ 
F 
T 


+ 
- 
- 
- 
F 
T 


+ 
+ 
- 
+ 
F 
F 


The A > B status provides the needed AND function of the sign bits of 0i and Ri . 
In computing these A > B values, if A > B is TRUE,the sequencer branches to code 
that rejects the line as invisible. A comparison A > B of A = (Oi x IRiI) and B = 
( I 0i I x Ri) gives the logical AND of the complement of the sign bits. It is TRUEwhen 
both 0i and Ri are positive. If all six values are TRUE, the sequencer can branch to 
code that passes the line segment unclipped. 


For a three-processor parallel system, lockstep operation with a single sequencer is 
still possible since all three processors are working on the same line segment, and 
the branch options apply equally to them all. The estimated time for a three-processor 
system is 56 cycles. 


Now that the operations have been reduced to a minimum, the remaining steps are 
necessarilysequential. Rejectinginvisible or passingtotally visible line segments without 
division, however, is still beneficial. 


In most graphics systems, full line clipping is not performed in the eye system. Instead, 
a trivial accept/reject test is performed, in which the line segments are simply tested 
against the six clipping planes. If a line has both ends on the invisible side of anyone 
of the clipping planes, it is rejected. Lines surviving this test may still be outside the 
viewing pyramid. In any case, the lines aretransformed to the screen coordinate system 
and then clipped against a cube defined by the simple plane equations - 1 < (X, Y, 
Z) < 1. The next three sections describe this process. 


Trivial Accept/Reject Test 


In the eye system, the clipping planes are: 


where W = Kx Z. After 
- W1 and - W2 are computed, a sequence of comparison 
operations are performed, summarized as follows: 


with X1 in RS and -W1 
in P, 
with X1 in RA and - W1 in e, 
with Y1 in RS and - W1 in e, 
with Y1 in RA, 
with Z1 in RS and n in RA, 
with Z1 in RA and F in RS, 


P > RS (i.e., -W1 
> X1) 
RA > lei 
(i.e., X1 > W1) 
e > RS 
RA > Ie I comparison 
RA > RS (i.e. N > Z1) 
RA > RS (i.e. Z1 > F). 


These six operations are carried out in successive cycles and then repeated for (X2, 
Y2, Z2). The two six-tuples are saved off-chip and a bit-wise AND is carried out. If 
anyone of the resulting six boolean values is TRUE, the line is rejected. This entire 
operation takes only 16 cycles, thereby providing a speed of 2,143,000 
line segments 
per second at 30 MHz. The data flow for an accept/reject test is given in Table 35. 
Accept/reject 
testing of individual points takes only 8 cycles. 


CHAIN 
N 
N 
Y 
Y 
Y 
Y 
Y 
Y 
Y 
Y 
Y 
Y 
Y 
Y 
N 
N 


RA 
K 
K 
X1 
Y1 
N 
Z1 
-W2 
X2 
-W2 
Y1 
N 
Z2 


RB 
Z1 
Z2 
X1 
Y1 
Z1 
F 
X2 
-W2 
Y1 
-W2 
Z2 
F 


S 


P 
-W1 
-W2 


C 
-W1 
-W1 
-W1 
-W1 


Y 
-W2 


-W1 
-W1 
-W2 
-W2 
STATUS 
X1>W1 
Y1>W1 
N>Z1 
Z1>F 
X2>W2 
Y2>W2 
N>Z2 
Z2>F 
>X1 
>Y1 
>X2 
>Y2 


CLK 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 


After the line segments have passedthe trivial accept/reject test, they are transformed 
to the screen coordinate system. The following transformation is first applied to the 
Z coordinate in order to scale its clipping planes to Z' 
= 
- W, and Z' 
= W: 


The value of 1/(F - 
N) is constant for all line segments and is therefore computed 
only once. In fact, two constants, a = 2K/(F - N) and b = (F + N)/2, can be available 
so that Z' = Z x ax (b + Z). (Note that other transformations on Z can also be used.) 


After the trivial accept/reject test, the following transformation to the screen system 
occurs: 


Z1' and Z2' can be formed in 8 cycles. Only two reciprocals, 1/W1 and 1/W2, need 
to be computed, and they can be interleaved and completed in 12 cycles in an '8847. 
The line segment is transformed to the screen system in a further 6 cycles. The total 
is 26 cycles for the' ACT884 7. A single-processor system would transform 1.2 million 
line segments per second at 30 MHz. 


Note that the above projection does not preserve planarity. See Newman and Sproull 
for perspective projections that do preserve planes. 


The Clipping Operation 


The final operation on line segments is to clip them to the cube: 


It is important to realize that the required resolution of Xs, Ysand Zs may only be 
10 or 11 bits. 


Consider a three-processor pipeline, with each processor clipping against two parallel 
planes. The first will clip against the x planes - 1 < X < 1. For clipping the P1 end 
of the line segment, 0 
= (1 + X1, 1 - X 1) is computed and 0' is formed, where 
OJ' = 0i - 
I0i I. I.e., 


0" 
= 2(1 '+ X1l. if (1 + X1) < 0; 01' 
= 0 otherwise. 
02' 
= 2(1 - 
X1l. if (1 - 
X1) < 0; 02' 
= 0 otherwise. 


At least one of OJ' will be zero; the other will be negative. Hence, MIN(01', 
02) 
01' + 02' 
= [(1 + X1) - 
!1 + X1!l + [(1 - X1) - 
11-X1!l. 
Therefore,MIN(O,., 
02') 
= (1 - 
IX11) - 
11 - 
!X111·So, t = l(m1-!m1!) 
/ 2dl and s = !(m2-lm21) 
/ 2d I, where mi = 1 - 
IXi 1m and d = X1 - 
X2. Note that only one reciprocal is 
required per processor. 


A three-processor parallel system would have each processor work on one dimension, 
supplying its pair of max parameters to a "second stage." The second stage would 
receive (tx, sxl. (ty, Syl. (tz, sz) from the above system, compute max(t) = T and 
maxIs) = 5, and then clip line as before: 


X1' 
= X1 + (X2 - 
X1)T, 
X2' 
= X2 - 
(X2 - 
X1)S. 


The data flow and program listing for the program run by a processor working on the 
X dimension are given in Tables 36 and 37. 


CHAIN 
y 
N 
Y 
Y 
N 
Y 
N 
Y 
Y 
Y 
Y 
Y 
Y 
Y 


RA 
X1 
I 
I 
RO 
d 


RS 
X2 
X1 
X2 
d 
0.5 


S 
d 
m1 
m2 
RO 
n1 
TO 
n2 
R1 
T1 


P 
dxRO 
RO 
R1 
dx R1 
0.5R1 


C 
m1 
m2 
m2 
y 
d 
n1 
n2 


STATUS 


CLK 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 


CHAIN 
y 
N 
N 
Y 
Y 


RA 
n1 
n2 


RS 


S 


P 
1/0 
t 
s 


C 
1/0 


Y 
t 
s 
STATUS 


CLK 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 


MULTIPLIER 
REGISTER TRANSFERS 
ALU OPERATION 
OPERATION 
1. 
LOAD RA, RB 
Y-S 
ADD (RA, - RB) 


2. 
LOAD RA, RB 
ADD (RA, - RB) 


3. 
LOAD RA, RB 
ADD (RA, - RB) 


4. 
LOAD RA, RB 
ADD (RA,O) 
MUL T (RA,RB) 


5. 
Y-S 
ADD (C,-ICll 


6. 
ADD (2, -Pl 
MULT (S,1) 


7. 
Y-S 
ADD (C, -ICIl 


8. 
MULT (S,P) 


9. 


10. 
LOAD RA 
ADD (P,O) 
MULT (RA,P) 
11. 


12. 
LOAD RB 
ADD (2, -P) 
MULT (S,RB) 


13. 


14. 
MULT (S,P) 


15. 


16. 
LOAD RA 
Y-P 
MULT (RA,P) 


17. 
LOAD RA 
Y-P 
MULT (RA,P) 


18. 


19. 


The three-processor parallel clipping system operates on a fixed loop of 17 instructions 
and can therefore clip 1.76 million line segments per segment at 30 MHz. The second 
stage could not keepup with this rate without being implemented as several processors. 
A single processor can form the two max values in 23 cycles (a loop of 21 cycles) 
while two processors would take only 12 cycles (a loop of 10). The final clipping of 
the two endpoints takes about 11 cycles (a loop of 9 cycles). 


To summarize, the fastest clipping system operates in the normalized screen coordinate 
system. It has six processors arranged in three stages - 
a three-processor parallel 
system with each processor working on each dimension and a single-processor third 
stage to clip the endpoints. The combined speed would be equal to that of the first 
stage, as previously described. A slightly slower four-processor system would use one 
processor for computing the two max values in the second stage. 


The previous section consideredseveralapproachesto the designof computer graphics 
systems based on the' ACT884 7. Table 38 summarizes the results. Table 39 shows 
the options available in combining the sub-systems listed in Table 38 into a design 
for a graphics system. 


SUB-SYSTEM 
SPEED AT 30 MHz 


a Transform, 4 x 4 matrix, 
1 'ACT884 7 cycle 
1.875 M points/s 


b Transform, 3 x 3 matrix, 
1 'ACT884 7 cycle 
2.5 M points/s 


c Eye clipping pipe, 
6 'ACT884 7 cycles 
1.2 M lines/s 


d Eye clipping 
3 'ACT884 7 cycles 
0.576 
M lines/s 


e Eye Accept/Reject test 
1 'ACT884 7 cycle 
2.143 M lines/s 


f 
Screen clipping 
5 'ACT884 7 cycles 
1.76 M lines/s 


g Screen clipping 
4 'ACT884 7 cycles 
1.42 M lines/s 


SUB-SYSTEM 
SPEED AT 30 MHz 


I 
(a or b) + c, 
7 'ACT884 7 cycles 
1.2 M lines/s 


II (a or b) + d, 
2 'ACT884 7 cycles 
0.576 
M lines/s 


III (a or b) + f, 
6 'ACT884 7 cycles 
1.76 M lines/s 


IV 2 x (a or b) + c + g, 
7 'ACT884 7 cycles 
3.75 M lines/s 


In the fourth system, it is assumed that 2 processors are used for the transform of 
endpoints so as to balance the high clipping rate. It is also assumed that the 
accept/reject stage will eliminate more than 60% of the line segments so that the 
clipping system can keep up with the transform processors. 


SN74ACT8867 
32·81T VECTOR PROCESSOR UNIT 


• 
Fast Numeric Processor 
Optimized for 
• 
Graphics 
• 
Simultaneous 
Independent 
Operation of 
• 
MUltiplier and ALU 
• 
Onboard Six-Port Register File 
• 
- 
46 Words X 40 Bits 
• 
Operates In Pipelined or Flow-Through 
• 
Mode 


On-Board Floating-Point 
Seed ROM for 
Newton-Raphson 
Division 
and Square Root 


Three 32-Bit Ports Support 
High-Data 
Bandwidth 


Selectable 
Format Conversion 
From Fixed- 
to Floating-Point 
at Input Ports 


• 
62-ns Cycle Time In Fully Pipelined Mode 


description 


On a single chip the SN74ACT8867 
Vector 
Processor 
Unit (VPU) combines 
a multiplier, 
Arithmetic 
Logic Unit 
(ALU), and a six-port 
register 
file for floating-point 
and integer 
arithmetic. 
The VPU performs 
single-precision 
floating-point 
operations 
and fixed-point 
operations 
in single or double 
precision. 


Applications 
such as graphics 
workstations 
and high-end 
digital signal 
processors 
can make use of the speed 
and flexibility 
offered 
by the independent 
multiplier 
and ALU, which are separately 
programmable 
and operate 
concurrently. 
The register 
file can be double-pumped 
to input four operands 
within 
one clock period, 
matching 
the input 
bandwidth 
of the register 
file to the data 
bandwidth 
through 
the multiplier 
and ALU 
in concurrent 
operation. 


The 'ACT8867 
accepts 
operands 
from its two 32-bit 
input buses as 2s complement 
integers, 
2s complement 
fractions, 
or floating-point 
numbers 
in Digital 
Equipment 
Corporation 
(DEC) 
'F' format. 
Internally, 
the VPU 
supports 
40-bit floating-point 
arithmetic 
and 32-bit or 64-bit fixed-point 
arithmetic. 


The 'ACT8867 
has two 32-bit input data buses and a 32-bit output data bus (see functional 
block diagram). 
The 
device 
supports 
40-bit floating-point 
internally 
and also supports 
2s complement 
integer 
operations. 
Forty-six 
general-purpose 
registers 
allow intermediate 
results to be stored internally. Two other pseudo-register 
locations, 


when addressed, 
serve as a flow-through 
function 
in the register file. In this mode, the register 
file is bypassed, 
allowing 
data to be fed directly 
from an external 
source 
to the processing 
units. 


Format 
conversion 
logic is available 
to translate 
fixed-point 
inputs to floating-point 
format 
without 
tying up the 
ALU. Similarly, floating-point 
results can be translated 
to fixed-point 
outputs by using the logic elements 
provided 
for float-to-fixed 
conversion 
and binary-point 
adjustment. 


When all the main functional 
blocks are active simultaneously, 
the VPU can perform a multiply, an ALU operation, 
and three format 
conversions 
(two fixed-to-floating-point 
and one floating-to-fixed-point) 
within 
a single 
clock 


cycle. 


The instruction 
set for the 'ACT8867 
is partitioned 
so that the two processing 
units, the ALU processor 
and the 
multiplier 
processor, 
can be operated 
independently 
or in parallel. 


Single-precision 
floating-point 
Newton-Raphson 
division 
and square 
root are supported, 
along 
with 
limited 
support for double-precision 
arithmetic. 
A single-precision 
seed table for Newton-Raphson 
division 
and square 
root is provided 
(see Table 6). Appropriate 
status is generated 
for all instructions 
and data types. The VPU can 
be operated 
in flow-through 
mode, 
or pipeline 
registers'are 
available 
to allow 
operation 
in pipeline 
mode to 
increase 
system 
speed. 
All pipeline 
registers 
may be placed 
in a scan mode for device-level 
and board-level 
diagnostics. 


The device 
operates 
with a 62-ns 
system 
clock 
and 
is designed 
with five 
percent 
power 
supply 
tolerance 


requirements 
over the commercial 
temperature 
range 
(O°C to 70°C). 


Selectable Format Conversion 
From 
Floating- to Fixed-Point 
at Output Ports 


Low-Power 1-l-lm EPIC'· CMOS Process 


TEXAS ." 
INSTRUMENTS 


SN74ACT8867 
32-81T VECTOR PROCESSOR UNIT 


Several 
naming 
conventions 
are used throughout 
this document. 
All signals 
are active 
high unless 
otherwise 
noted. Signal transition 
characteristics 
are standard 
TIL 
levels with positive 
logic. Bus labels are given with the 
Least Significant 
Bit (LSB) designated 
as bit O. The Most Significant 
Bit (MSB) 
is designated 
as bit 39 or as 
appropriate 
for the bus width. 
In figures, 
the MSB is located 
at the left and the LSB is at the right. The register 
file is commonly 
referred 
to by the abbreviation 
RF and the multiplier 
is abbreviated 
as MULT. 


data flow 


The 'ACT8867 
VPU 
is a three-port 
device 
with independent 
multiplier 
and ALU blocks 
(see functional 
block 
diagram). 
Input and output 
functions 
are enhanced 
by data format 
conversion 
blocks 
which 
may operate 
in 
parallel 
with the arithmetic 
blocks. 


Each input data bus is fed to a fixed-to-float 
conversion 
block. These blocks contain 
registers 
to latch input data, 


if desired. 
These 
blocks can either pass data or convert 
integer or fractional 
values to floating-point 
format. The 
output of each conversion 
block is connected 
to a two-input 
mUltiplexer 
which 
selects 
data for the write buses 
of the register 
file. 


The register file has 46 locations, 
each of which is 40 bits wide. Two data operands 
may be written simultaneously 
through 
the two write inputs. 
Four read outputs 
are available 
to read any data value and output 
it to the inputs 
of the multiplier 
and ALU. 


Both inputs to the multiplier 
may receive 
data from the register file, the ALU, or feedback 
from itself. The A input 
may also select output from the seed ROM to start division or square-root 
operations. 
Data registers 
are provided 
on each 
input to the multiplier 
and the 
instruction 
bits are also 
registered. 
A single 
write 
enable 
and 
two 
flow-through 
pins control 
the function 
of these 
registers. 
The multiplier 
instructions 
support 
single-precision 
floating-point, 
integer, and multiple-precision 
fixed-point 
operations. 
Internal outputteedback 
may be registered 
or flow-through 
for output from the device. 


Both inputs to the ALU may receive 
data from the register 
file or feedback. 
The A input allows feedback 
from 
the multiplier 
and the B input allows feedback 
from the ALU. Data registers 
are provided 
on each input to the 
ALU and the instruction 
bits are also registered. 
A single 
write enable 
and two flow-through 
pins control 
the 


function 
of 
these 
registers. 
The 
ALU 
instructions 
support 
single-precision 
floating-point, 
integer, 


multiple-precision 
fixed-point, 
and logical 
operations. 
Output 
feedback 
may be registered 
or flow-through 
for 
output from the device. 


Results from the multiplier 
and the ALU may be passed from their output registers 
to the float-to-fixed 
conversion 
block. A multiplexer 
selects which unit supplies 
input to this block. This block can convert floating-point 
numbers 
to fixed-point 
values 
if the format 
conversion 
is selected. 
Rounding 
may also be optionally 
performed 
in this 
block. 
Parity is generated 
from the output 
of this block. 


Status 
is generated 
from the fixed-to-float, 
multiplier, 
ALU, and float-to-fixed 
blocks 
and may be latched 
in the 
status 
register. 
Portions 
of this status 
may be selected 
for output 
on the six status 
pins as needed. 


Status 
pins, Y output 
pins, and parity pins are all buffered 
for output. 
The buffers 
can set the output 
pins to a 
high-impedance 
state. 
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'ACT8867 
package 
information 


Pin descriptions and grid assignments for the 'ACT8867 are given on the following pages. Pin number A1 is 
marked with a square for indexing purposes. 


208-PIN 
GA PACKAGE 
(TOP VIEW) 


2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 


A 
[!J • • • • • • • • • • • • • • • • 


B • • • • • • • • • • • • • • • • • 
c • • • • • • • • • • • • • • • • • 


0 • • • • • • • • • • • 
• • • • • 


E • • • • 
• • • • 


F • • • • 
• • • • 


G • • • • 
• • • • 


H • • • • 
• • • • 
J • • • • 
• • • • 


K • • • • 
• • • • 


L • • • • 
• • • • 


M • • • • 
• • • • 


N • • • • 
• • • • 
p • • • • • • • • • • • • • • • • • 


R • • • • • • • • • • • • • • • • • 
S • • • • • • • • • • • • • • • • • 


T • • • • • • • • • • • • • • • • • 


TEXAS -If 
INSTRUMENTS 


SN74ACT8867 
32·81T VECTOR PROCESSOR UNIT 


RFCLK 
A5-AO 
B5·BO 
C5·CO 
05·00 
E5·EO 
F5-FO 


ENA 
ENBPA 
ENDAT 
ENM 
ENP 
ENS 
OES 
OEY 


<I> 


SN74ACT8867 
VECTOR PROCESSOR 
FLOATING·POINT 
SORT 


SYSTEM CLOCK 
FLOATING-POINT 
DIV <::::L1 


BINARY POINT ADJ. 


FIXED·TO-FLOAT 
I CONVERSION 
BLOCK 
ROUND MODE SELECT 


CONTROL 
CARRY IN 


RFCLOCK 
TEST PINS 
6 
RF ADDRESS 
A I 
6 
RF ADDRESS 
B 
WRITE 
BINARY POINT 


6 
ADJ. REGISTER 
RFADDRESSC 
6 
RFADDRESS 
0 
INPUT 


6 
RFADDRESS 
E 
READ 
REGISTERS 


6 
RFADDRESS 
F 
INSTRUCTION 
REGISTER 
REGISTERS 


"- :~: I WRITE ENABLE 


FLOW 
MULT& 
ALU 


"- 


THRU 
REGISTERS 


PROD&SUM 


0 
REGISTERS 


1 
STATUS 


2 
ALU 
REGISTER 


3 
4 
ALU MUX 
INSTRUCTION 
MULTMUX 


RFMUXA 
I 
DA 
<::::L1 
5 
PRO 


6 
SELECT 
RFMUXB 
I 
DB 
d...1 
2 
7 
MULT 
SUM 


8 
STATUS SOURCE 
.: 


"- 


YMUX 
I 
ALU 
d-1 
ALU INSTRUCTION 
& INPUT REGs 
MULT 


"- 
BINARY POINT ADJUST 
REGISTER 


"- 
DA, DB DATA REGISTERS 
ERROR 


"- 
MULT INSTRUCTION 
& INPUT REGs 
ENABLE 
CARRY OUT/UNDERFLOW 


"- 
PRODUCT 
REGISTER 
SIGN 


"- 
SUM REGISTER 
STATUS 
ZERO 


"- 
STATUS OUTPUT 
OVERFLOW 


"- 
Y31 - YO, PY OUTPUTS 
DIV-BY~/UNK 
F/F 
..., 
r 


0 


~ 
~ 


0 
• 
• 
• 
• 
• 
• 
31 
31 


0 


~ 


PARITY 
••• 
31 


FFEN 
FMAT 


DAO••• 
DA31 


DBO••• 
DB31 


BPA4·PBAO 


RND 
CIN 


SELA1·SELAO 
SELM3-SELMO 


SELRFA 


SELST1·SELSTO 


SELY 


ERROR 
COUT/U 
N 
Z 
OVER 
STEX 
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0A3 
·OAO 


32 


OBJ ·OBO 


32 


RFClK 
---...- 


WEA---...- 


WEB---"'- 


~ENOAT 


~FMAT 
~FFEN 


~FTOAT 


~ENA 


~ENM 
~ENP 


~ENS 
~ENBPA 


SElM3 
SElM2 
2 


SElMl 
SElMO 
2 


A5.AO~ 


B5·BO ~ 


C5·CO ~ 


OS-DO 
~ 


E5-EO~ 


F5.FO~ 


--+- 
ClK 


-----+- 
FTM/A 


-----+- 
FTI 


-----+- 
FTP/S 


-----+- 
FTBPA 


-----+- 
RNO 


~ 
TP1·TPO 


~VCC 


~GNO 


3 
SElSTl 
SElSTO 
FTST 


ERROR, COUTIZ. 
N, Z, OVER, STEX 
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PIN 
PIN 
PIN 
PIN 
PIN 
PIN 


NO. 
NAME 
NO. 
NAME 
NO. 
NAME 
NO. 
NAME 
NO. 
NAME 
NO. 
NAME 


Al 
Y25 
C2 
RND 
E3 
6PA2 
J15 
065 
Pl 
DA13 
S2 
DA22 


A2 
Y24 
C3 
6PA3 
E4 
6PAO 
J16 
062 
P2 
DA27 
S3 
A3 


A3 
Y23 
C4 
SELY 
E14 
OEY 
J17 
066 
P3 
DA28 
S4 
DA31 


A4 
Y21 
C5 
Y31 
E15 
ENS 
Kl 
DA5 
P4 
DA29 
S5 
A4 


AS 
Y19 
C6 
Y29 
E16 
FTPS 
K2 
DA10 
P5 
FFEN 
S6 
64 


A6 
Y18 
C7 
Y22 
E17 
SELAl 
K3 
DA8 
P6 
A2 
S7 
60 


A7 
Y17 
C8 
VCC 
Fl 
SELM2 
K4 
GND 
P7 
VCC 
S8 
CO 


A8 
Y16 
C9 
VCC 
F2 
MI6 
K14 
VCC 
P8 
C4 
S9 
SELRFA 


A9 
Y13 
Cl0 
VCC 
F3 
MI8 
K15 
069 
P9 
SELRF6 
S10 
04 


Al0 
Yll 
Cll 
VCC 
F4 
6PA4 
K16 
067 
Pl0 
DO 
Sll 
E4 


All 
Yl0 
C12 
Y3 
F14 
FTST 
K17 
068 
Pll 
EO 
S12 
F4 


A12 
Y9 
C13 
COUT/U 
F15 
OVER 
Ll 
DA7 
P12 
GND 
S13 
FO 


A13 
Y8 
C14 
CIN 
F16 
AI4 
L2 
DA12 
P13 
FTDAT 
S14 
0631 


A14 
Y6 
C15 
YO 
F17 
FTMA 
L3 
DA14 
P14 
0629 
S15 
F3 


A15 
Y4 
C16 
N 
Gl 
SELMO 
L4 
OA17 
P15 
0628 
S16 
Fl 


A16 
Y2 
C17 
All 
G2 
SELM3 
L14 
0615 
P16 
0627 
S17 
0619 


A17 
Z 
01 
MIS 
G3 
ENP 
L15 
0613 
P17 
0616 
Tl 
0A20 


61 
TPl 
02 
6PAl 
G4 
VCC 
L16 
0611 
Rl 
DA15 
T2 
Al 


62 
Y28 
03 
EN6PA 
G14 
GND 
L17 
0610 
R2 
OA18 
T3 
65 


63 
Y26 
04 
FT6PA 
G15 
AI2 
Ml 
OA9 
R3 
AS 
T4 
63 


64 
PY 
05 
GNO 
G16 
AIO 
M2 
OA19 
R4 
OA30 
T5 
61 


65 
Y30 
06 
GNO 
G17 
061 
M3 
DA21 
R5 
FMAT 
T6 
C5 


66 
Y27 
07 
GNO 
Hl 
OAO 
M4 
0A23 
R6 
AO 
T7 
C3 


67 
Y20 
08 
GND 
H2 
OAl 
M14 
0622 
R7 
62 
T8 
Cl 


68 
Y15 
09 
GNO 
H3 
DIV/SQRT 
M15 
0620 
R8 
C2 
T9 
WEA 


69 
Y14 
010 
GNO 
H4 
SELMl 
M16 
0618 
R9 
WE6 
Tl0 
RFCLK 


610 
Y12 
011 
GNO 
H14 
SELAO 
M17 
0612 
Rl0 
02 
Tll 
05 


611 
Y7 
012 
GNO 
H15 
060 
Nl 
DAll 
Rl1 
E2 
T12 
03 


612 
Y5 
013 
GNO 
H16 
CLK 
N2 
DA24 
R12 
F2 
T13 
01 


613 
ERROR 
014 
DES 
H17 
063 
N3 
0A25 
R13 
ENOAT 
T14 
E5 


614 
STEX 
015 
SELSTO 
Jl 
DA2 
N4 
0A26 
R14 
0630 
T15 
E3 


615 
Yl 
016 
SELSTl 
J2 
DA6 
N14 
0626 
R15 
0623 
T16 
El 


616 
FTl 
017 
ENA 
J3 
OA3 
N15 
0625 
R16 
0621 
T17 
F5 


617 
AI3 
El 
ENM 
J4 
OA4 
N16 
0624 
R17 
0617 


Cl 
MI7 
E2 
TPO 
J14 
064 
N17 
0614 
Sl 
OA16 
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PIN 


NAME 
NO. 
I/O 
DESCRIPTION 


AO 
R6 


A1 
T2 
Register File Write Address A. These bits select destination 
registers for feedback data from the 
A2 
P6 
I 
multiplier or input data from the DA bus. Values may be stored in locations 0 to 45. Also, these 
A3 
S3 
bits are used with the WEA pin to enable writing. Writing to a nonexistent 
location performs no 
A4 
S5 
useful operation, generates no status error, and does not affect other data in the register file. 
A5 
R3 


AIO 
G16 


AI1 
C17 
AlU 
Instruction. 
The AlU 
instruction bits are input through the instruction 
register in flow-through 
or 
AI2 
G15 
I 
AI3 
B17 
registered mode. 


AI4 
F16 


BO 
S7 
Register File Write Address B. These bits select destination 
registers for feedback data from the 
B1 
T5 
B2 
R7 
multiplier or input data from the DB bus. Values may be stored in locations 0 to 45. Also, these 


B3 
T4 
I 
bits are used with the WEB pin to enable writing. Data on the B port is ignored if the B write 


B4 
S6 
address matches the A write address. Writing to a nonexistent 
location performs no useful oper- 


B5 
T3 
ation, generates no status error, and does not affect other data in the register file. 


BPAO 
E4 
BPA1 
D2 
Binary Point Adjust. 
Thebinary 
point adjust pins set the integer size of the fixed-point format used for 


BPA2 
E3 
I 
output float-to-fixed 
conversions. 
These bits are input through the BPA register in flow-through 
or 
BPA3 
C3 
registered mode. 


BPA4 
F4 


CO 
S8 


C1 
T8 
C2 
R8 
Register File Read Address C. These bits select operands for the A side of the multiplier. The read 
C3 
T7 
I 
operation is asynchronous 
with respect to all other register file controls. 
C4 
P8 
C5 
T6 


CIN 
C14 
I 


Carry-In Bit Control Line. Carry-in data bit or control line is used in AlU 
operations. 
This bit is input 


through the AlU 
instruction register in flow-through 
or registered mode. 


ClK 
H16 
I 


Clock. Main system clock for all registers except those associated with the register file and conversion 
blocks. When enabled, internal registers are latched on the rising edge. 


Carry-OuWnderflow 
Status. In unsigned operations, this pin is the carry-out bit and is updated to indicate 


COUT/U 
C13 
0 
overflow conditions. 
When the result of a floating-point 
operation 
is too small to be represented 
in 
normalized format, this bit is asserted and the result is forced to zero. It is also used in combination 
with 
the Z, N, and OVER status bits for compare operations. Additional 
information under OES. 


DO 
P10 


D1 
T13 
D2 
R10 
Register File Read Address D. These bits select operands for the B side of the mUltiplier. The read 


D3 
T12 
I 
operation is asynchronous 
with respect to all other register file controls. 


D4 
S10 
D5 
T11 
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PIN 


NAME 
NO. 
I/O 
DESCRIPTION 


DAO 
H1 
DA1 
H2 
0A2 
J1 
DA3 
J3 
OM 
J4 
DA5 
K1 
DA6 
J2 
DA7 
L1 
DA8 
K3 
DA9 
M1 
DA10 
K2 
DA11 
Nl 
DA12 
L2 
DA13 
P1 
DA14 
L3 
DA15 
Rl 
Data Bus A Side. 
Input 32-bit data bus to the A side of the register file. The data format may be 
DA16 
Sl 
I 
converted or passed by the fixed-to-float 
block. The format is selected by the FFEN and FMAT pins. 


DA17 
L4 
DA18 
R2 
DA19 
M2 
DA20 
Tl 
DA21 
M3 
DA22 
S2 
0A23 
M4 
DA24 
N2 
DA25 
N3 
DA26 
N4 
0A27 
P2 
DA28 
P3 
0A29 
P4 
DA30 
R4 
DA31 
S4 


DBO 
H15 
DBl 
G17 
DB2 
J16 
DB3 
H17 
DB4 
J14 
DB5 
J15 
DB6 
J17 
DB7 
K16 
DB8 
K17 
DB9 
K15 
DB10 
L17 
DBll 
L16 
DB12 
M17 
DB13 
L15 
DB14 
N17 
DB15 
L14 
Data Bus B Side. 
Input 32-bit data bus to the B side of the register file. The data format may be 
DB16 
P17 
I 
converted or passed by the fixed-to-float 
block. The format is selected by the FFEN and FMAT pins. 


DB17 
R17 
DB18 
M16 
DB19 
S17 
DB20 
M15 
DB2t 
R16 
DB22 
M14 
DB23 
R15 
DB24 
N16 
DB25 
N15 
DB26 
N14 
DB27 
P16 
DB28 
P15 
DB29 
P14 
DB30 
R14 
DB31 
S14 
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PIN 


NAME 
NO. 
VO 
DESCRIPTION 


DIV/SQRT 
H3 
I 
Divide/Square 
Root. The divide table from the divide/square 
root ROM is selected when pin is low and 
square root table is selected when pin is high. 


EO 
Pll 
El 
T16 
E2 
Rll 
Register File Read Address E. These bits select operands for the A side of the AlU. 
The read 
E3 
TIS 
I 
operation is asynchronous 
with respect to all other register file controls. 


E4 
SII 
Es 
T14 


Enable Control for AlU 
Input Registers and AlU 
Instruction Register, active low. Data will be latched 
ENA 
017 
I 
on the rising edge of ClK when this signal is low. 


ENBPA 
Enable Control for Binary Point Adjust Register, active low. Data will be latched on the rising edge of ClK 
03 
I 
when this signal is low. 


Enable Control for the Input Registers and Multiplier Instruction Register, active low. Data will be latched 
ENDAT 
R13 
I 
on the rising edge of ClK when this signal is low. 


ENIM 


Enable Control for Multiplier Input Registers and Multiplier Instruction Register, active low. Data will be 
El 
I 
latched on the rising edge of ClK 
when this signal is low. 


ENP 
Enable Control for the Product Register, active low. Output from the multiplier will be latched on the rising 
G3 
I 
edge of ClK 
when this signal is low. 


ENS 


Enable Control forthe 
Sum Register, active low. Output from the AlU 
will be latched on the rising edge 
Els 
I 
of ClK when this signal is low. 


Chip Error Status. This bit is updated every clock cycle, and is set when an overflow or underflow has 
occurred in any of the three status sources, a divide by zero is attempted in the multiplier, or a reserved 


ERROR 
B13 
0 
floating-point operand is inputto the chip. The ERROR signal works only in conjunction with floating-point 
and 2s complement 
operations 
in the arithemetic 
blocks. This pin will not be set by the AlU 
during a 
floating-point 
compare operation and should be carefully interpreted for multiple precision operations. 


Additional 
information under OES. 


FO 
S13 
Fl 
S16 
F2 
R12 
Register File Read Address F. These bits select operands for the B side of the AlU. The read operation 
F3 
Sls 
I 
is asynchronous 
w~h respect to all other register file controls. 
F4 
S12 
Fs 
T17 


Fix-to-Float Enable. This signal controls the fixed-to-float 
conversion block. When FFEN is high, integer 
orfractional 
values on the DA and DB data input buses are converted to the internal floating-point 
format. 


FFEN 
PS 
I 
When FFEN is low, integer, fractional, M string, and floating-point 
values are input without conversion. 


The state of this control is latched along with the input data values. Works in conjunction with FMAT (See 
Table 13). 


FormatType. 
This signal controls the format type of the data values on input buses DA and DB. The state 
FMAT 
Rs 
I 
ofthis control is latched along with the input data values. Works in conjunction with FFEN (See Table 13). 


FTBPA 
04 
I 
Flow-Through 
Control for the Binary Point Adjust Register. When high, the BPA register is bypassed. 


Flow-Through Control for the Input Registers on the Data Input Buses DA and DB. When high, the input 
FTDAT 
P13 
I 
registers are bypassed. 


Flow-Through 
Control for the Multiplier 
and AlU 
Instruction 
Registers. 
When high, the instruction 
FTI 
B16 
I 
registers are bypassed. 


Flow-Through 
control for the MUlTA, MUlTB, 
ALUA, and AlUB 
Registers. When high, these registers 
FTMA 
F17 
I 
will be in flow-through 
mode. 


Flow-Through 
Control for Product and Sum Output Register. When high, the product and sum registers 
FTPS 
E16 
I 
are bypassed. 


FTST 
F14 
Flow-Through 
Control for Status Register. When high, the status register is bypassed. 
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PIN 


NAME 
NO. 
I/O 
DESCRIPTION 


P12 
G14 
D13 
D11 
D10 
GND 
D9 
I 
Ground. 
All ground pins must be used and connected. 
D8 
D7 
D6 
D5 
K4 


MI5 
D1 
MI6 
F2 
Multiplier Instruction. 
The multiplier instruction bits are input through the instruction register in flow- 
MI7 
C1 
I 
through or registered mode. 
MI8 
F3 


Negative Sign. The N bit is a duplicate of the sign of the result from the multiplier, ALU, or float-to-fixed 


N 
C16 
0 
block. For overflow, N indicates the direction of overllow. Also, N is used in integer and floating-point 
compare operations. Additional information 
under OES and OVER. 


OES 
D14 
I 
Enable Control for the Status Output Drivers, active low. When low, the status pins will output normally. 
When high, they will be in an high-impedance 
state. 


Enable Control for the Y Output Drivers, active low. When low, the Y outpu1 pins will output normally. 
OEY 
E14 
J 
When 
high, 
the 
Y outputs 
will 
be in high-impedance 
state. 
OEY 
also 
sets 
the 
PY output 
to 
high-impedance 
state. 


Overllow. OVER is set high when an overflow condition occurs in the multiplier, ALU, or float-to-fixed 


OVER 
F15 
0 
block. The sign status bit indicates the direction of overflow, with N low for positive overllow and N high 
for negative overflow. The output data value from operations that overflow is undetermined. 
Additional 
information under OES and SELST1-SELSTO. 


Even Parity. Even parity is generated 
from the final result of the float-to-fixed 
block and presented 
PY 
B4 
0 
ex1ernally at the rising edge of the clock. Additional information 
under OEY. 


RFCLK 
T10 
J 
Register File Clock. Data is latched for writing onthe rising edge of the clock. Additional information under 
ENDAT, WEA, and WEB. 


Round Mode Select. When low, no rounding is perlormed in the float-to-fixed 
block:When 
high, the result 


RND 
C2 
I 
of a floating-point 
output or conversion is rounded. RND is input through the BPA register in flow-through 
or registered mode. 


SELAO 
H14 
SELA1 
E17 
I 
Select ALU Multiplexer. 
Control for the ALU input multiplexers. 


SELMO 
G1 
SELM1 
H4 
SELM2 
F1 
I 
Select Multiplier Multiplexer. 
Control for the multiplier input multiplexers. 


SELM3 
G2 


SELRFA 
S9 
J 
Select Register File A. Multiplexer control from write data to the A side of the register file. 


SELRFB 
P9 
I 
Select Register File B. Multiplexer control from write data to the B side of the register file. 


SELSTO 
D15 
Select Status Source. Status from the multiplier, ALU, orfloat-to-fixed 
block may be selected for display 
SELST1 
D16 
on the status output pins by SELST1-SELSTO. 


SELY 
C4 
I 
Select Y Multiplexer. This control selects input to the output float-to-fixed 
block. 


State of ALU/Divide by Zero. This bit indicates divide by zero from the multiplier and the state of the link 
STEX 
B14 
0 
!l!E:!!op in the ALU depending 
on the state of the SELST1-SELSTO 
pins. Additional 
information 
under 
OES. 


TPO 
E2 
TP1 
B1 
I 
Test Pin. A complete explanation 
is given in Table 25. 
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NAME 
NO. 
I/O 
DESCRIPTION 


K14 
C11 
C10 


VCC 
C9 
I 
Supply voltage (5 V) 


C8 
G4 
P7 


WEA 


Write Enable for Register File A Input. active low. When low. a write operation may be performed 
by 
T9 
I 
asserting data and write address and toggling RFCLK. 


WEB 
T 


Write Enable for Register File B Input. active low. When low. a write operation 
may be performed 
by 


R9 
asserting data and write address and toggling RFCLK. 


YO 
C15 
Y1 
B15 


Y2 
A16 


Y3 
C12 
Y4 
A15 
Y5 
B12 


Y6 
A14 
Y7 
B11 
Y8 
A13 


Y9 
A12 
Y10 
A11 
Y11 
A10 
Y12 
B10 
Y13 
A9 
Y14 
B9 
Y15 
B8 
Output 32-Bit 
Data Bus. Output data may be converted 
from a floating-point 
to fixed-point 
in the 
Y16 
AS 
OIZ 
float-to-fixed 
block. These outputs may be set to high-impedance 
state by the OEY pin. 
Y17 
A7 
Y18 
A6 
Y19 
A5 


Y20 
B7 
Y21 
A4 
Y22 
C7 
Y23 
A3 


Y24 
A2 
Y25 
A1 
Y26 
B3 


Y27 
B6 


Y28 
B2 


Y29 
C6 
Y30 
B5 
Y31 
C5 


Z 
A17 
0 
Zero Status. This bit is set when the result of an operation is zero. Additional 
information 
under OES. 


register 
file architecture 


The register 
file of the 'ACT8867 
VPU is a master-slave 
memory 
with two write ports and four read ports. The 


register 
file has 46 locations, 
each 40 bits wide. Each of the write and read ports are addressed 
by a separate 


6-bit address 
bus. Location 
0 is addressed 
by a zero on the address 
pins and subsequent 
locations 
1 to 45 are 


addressed 
in normal binary fashion. 
The write address 
pins are labeled A5-AO and 85-80 
and control 
data from 


the RF MUX A and RF MUX 8 data buses, respectively. 
The read addresses 
are labeled 
C5-CO, 05-00, 
E5-EO, 


and F5-FO and select data for the register file outputs going to the multiplier 
and ALU multiplexers. 
Read address 


pins C5-CO and 05-00 
select 
operands 
for the multiplier, 
whereas 
read address 
pins E5-EO and F5-FO select 
operands 
for the ALU. Feedback 
to the register 
file from the multiplier 
and ALU is multiplexed 
into the A and 8 


write inputs, 
respectively. 
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Generally, data may be written into or read from any register file location by any input or output simultaneously. 
However, in the case of a simultaneous write to the same location, the data from RF MUX A is actually stored 
in the location, while the data from RF MUX B is ignored. Other than this, all read and write cycles may occur 
together in a single clock cycle. Addressing nonexistent locations is harmless, but data read from a nonexistent 
location will be undefined. No error status is generated from any of these conditions. The write cycle is an 
edge-triggered master-slave cycle with data latched on the rising edge of the register file clock (RFCLK). 


WRITE ENABLE 
REGISTER 
FILE 
CONTROL 
ADDRESS 
LOCATION 
INPUT SOURCE 


WEAlow 
AS-AO 
RF MUXA 


WEB low 
BS-BO 
RF MUX B 


REGISTER 
FILE 
ADDRESS 
LOCATION 
OUTPUT DESTINATION 


CS-CO 
MULTMUXA 


DS-DO 
MULTMUXB 


E5-EO 
ALU MUXA 


F5-FO 
ALU MUX B 


data Inputs 


Two sources, the fixed-to-float conversion block and the multiplier output, supply inputs to the RF MUX A. Input 
selection to RF MUX A is controlled externally by the SELRFA select line.Simiiarly, RF MUX B inputs come from 
the ALU output or the fixed-to-float conversion block. Input selection to RF MUX B is controlled externally by 
the SELRFB select line. 


Data from either the DA or DB input bus is converted into the internal 40-bit data format of the VPU by the 
fixed-to-float block. Each 32-bit external input is converted into an internal format suitable to the operand type, 
either a floating-point number or an integer. 


The fixed-to-float format conversion is selected by the FMAT control pin. The same format control is used for 
both inputs so only similar operand types may be input simultaneously. If FMAT is low, the input is packed as 
a floating-point number. If FMAT is high, it is packed as an integer for the purpose of the packing operation, 
signed fractional, unsigned fractional, unsigned integers, and normal integers are eqUivalent. Signed integers 
or signed fractional numbers may optionally be converted to floating-point numbers in this block, as described 
infixed-to-float block section. The following examples show the data format ofthe signed and unsigned fractional 
operands: 


Signed Fraction, 32 bit 2s complement. 
I 31 130 
01 
sign <-MSB---------------,LSB-> 
Note: magnitude of bit 30 is 2E-1. 
Example: +0.125 = H'10000000' 


- 0.125 = H'FOOOOOOO' 


Unsigned Fraction, 32 bit. 
131 
01 
<-MSB---------------LSB-> 
Note: magnitude of bit 31 is 2E-0. 
Example: +0.125 = H'10000000' 
+1.125 = H'90000000' 
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RFMUXA 
RF MUX B 


SELRFA 
INPUT SOURCE 
SELRFB 
INPUT SOURCE 


L 
producl 
L 
sum 


H 
fixed-Io-float 
block A 
H 
fixed-Io-floal 
block B 


address 
inputs 


Each data port of the register file has a separate 
address 
control bus. Each address 
bus is six-bits wide, allowing 
a total of 64 locations 
to be addressed. 
However 
in the register file, there are only 48 usable locations 
(locations 
o through 
47 decimal). 
The 16 addresses 
to nonexistent 
locations 
may be input without 
harmful 
effects, 
except 
that undefined 
data is output for a read operation 
to these 
invalid 
addresses. 


The address 
buses 
A5-AO and B5-BO control 
the addresses 
for the two RF write 
inputs 
(see Table 
1). The 
address 
inputs work in conjunation 
with the write enable 
lines (WEA and WEB) 
and register 
file clock RFCLK 
to perform 
a write operation 
(see Table 2). Read address 
buses C5-CO and D5-DO select 
input data for the A 
MUX and B MUX of the multiplier, 
respectively. 
Read address 
buses E5-EO and F5-FO select 
input data for the 
A MUX and B MUX of the ALU, respectively. 


The RF read addresses 
select locations 
asynchronously 
to the write addresses, 
write enables, 
register file clock, 
and each 
other. 
The data stored 
at the addressed 
location 
will be available 
at the outputs 
after the output 
propagation 
delay, irrespective 
of all other control 
lines to the register file. Since a write changes 
the data stored 


in a location, 
a concurrent 
read from this location 
will depend 
on the write delays 
before the updated 
data will 
be available 
at the outputs. 
The updated 
data will be available 
after the rising edge of the RFCLK and the normal 
read propagation 
delay. A simultaneous 
read during 
a write cycle will produce 
the old value and a read in the 
subsequent 
cycle will produce 
the new value. 


Locations 
46 and 47 decimal 
of the register 
file are special 
purpose 
addresses 
used to implement 
the full 
flow-through 
mode of operation 
forthe 
register file. Full flow-through 
mode is activated 
by setting the write enable 
WEA high on the A write port orWEB 
on the B write port and addressing 
location 
46 decimal for A port, or location 


47 decimal 
for B port, respectively. 
No memory 
is associated 
with these locations. 
They continually 
reflect the 
data being input on the port A and port B inputs to the register 
file. 


Addressing 
location 
46 decimal 
for any read will output the current 
input on port A to the selected 
read output. 


Addressing 
location 
47 decimal 
for any read will output the current 
input on port B to the selected 
read output. 


Addressing 
these locations 
at any read port while holding 
WEA or WEB low will result in a partial flow-through 
along with an active write mode. 
In the partial flow-through 
mode, the read port reflects the condition 
of output 
of the master write latches. These master latches are transparent 
when RFCLK is low, and they capture the value 
of port A or port B on the positive 
edge of RFCLK. As such, the flow-through 
operation 
is disabled 
when RFCLK 
is high. 


edge-triggered 
write 
mode 


The write operation 
of the register 
file is implemented 
as an edge-triggered 
master-slave 
cycle where the state 
of the data inputs, 
write addresses, 
and write enables 
are latched 
on the rising edge of the register 
file clock 
(RFCLK). 
The new data is available 
to be read from the write location 
irrespective 
of the falling edge of the clock 
after only a propagation 
delay time from the rising edge. Enables for the write operations 
of input port A and port 
B are the signals 
WEA and WEB, 
respectively. 
These 
signals 
are active 
low and must be at a low 
logic level 
to perform 
a write 
operation. 
When 
the enables 
are at a high logic level, the associated 
inputs 
are in a full 
flow-through 
mode. 
No data is written 
to the register 
file while it is in full flow-through 
mode. 
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A write may be performed 
to any of the register file locations 
0 through 
45 decimal 
from input port A or input port 


B separately 
or concurrently. 
However, 
if a concurrent 
write is attempted 
to the same location, 
the data from input 


port A will be written 
and the data from input port B will be ignored. 
No status error is generated 
when data from 


the B port is lost during a concurrent 
write from the A and B ports to the same location. 
Addressing 
to locations 


48 through 
63 decimal 
performs 
no useful function. 
To perform 
a write operation: 


1. 
Set up data and address, 
and bring write enable(s) 
low. 


2. 
Toggle 
RFCLK 
(low-to-high 
transition). 


3. 
Return 
write enable(s) 
to high. 


If the data input register 
in the fixed-to-float 
block is enabled, 
then data from the input pins is latched first in this 


register; 
the data may be written 
to the register 
file on a subsequent 
cycle. 


flow-through 
mode 


The full flow-through 
mode of the register 
file is implemented 
by the use of two special 
purpose 
register 
file 


locations 
in which the storage 
latch directly connects 
the data lines to the register file. Therefore, 
they will always 


produce 
the current 
values 
of these 
lines when they are addressed 
by a read operation. 
Location 
46 decimal 


is connected 
to the port A input and location 
47 decimal 
is connected 
to the port B input. As a result, either port 


A or port B data is accessible 
to any output 
of the register 
file. 


The input values 
for these locations 
depend 
on the state of the write enables. 
When a write enable 
is high, the 


input master 
latch for the data input is bypassed 
and any data applied 
to the port A or port B input is directly 


reflected 
in the appropriate 
flow-through 
location 
(see Table 4). If the write enable 
is low, then the value on the 


data lines is the data stored 
in the input master 
latch. For example, 
if WEA is high and WEB is low then register 


file location 
46 will be the same as the data being applied 
to input DA (assuming 
FTDAT and SELRFA 
are high) 


and location 
47 decimal 
will be the data on the input master 
latch of input port B. When the RFCLK 
is high the 


values of WE A and WEB have been latched at the rising edge and will be maintained 
at those values until RFCLK 


is low again. 


ADDRESS 
LOCATION = 46 DECIMAL 
ADDRESS 
LOCATION = 47 DECIMAL 


WEA 
RFCLK 
DATA SOURCE 
WEB 
RFCLK 
DATA SOURCE 


H 
X 
Port A (full flow-through) 
H 
X 
Port B (full flow-through) 


L 
L 
Port A Input Latch (transparent) 
L 
L 
Port B Input Latch (transparent) 


L 
H 
Port A Input Latch t 
L 
H 
Port B Input Latch t 


double-pumped 
mode 


The register 
file is fast enough 
to allow operation 
in a double-pumped 
mode 
(see Figure 
1).For each system 


cycle, two operands 
per input bus may be written 
to the register 
file by clocking 
the register 
file clock (RFCLK) 


at twice the speed of the main system 
clock. This feature 
is very useful in conjunction 
with the input fixed-to-float 
register. 
It allows a feedback 
write to the register 
file and a data input operation 
to be completed 
through 
each 


write port every cycle of the main system 
clock. 
Feedback 
data is written 
at the rising edge of the main clock 


and input data is latched 
in the fixed-to-float 
register 
at the same time. The input data is then written 
on the half 


cycle without 
affecting 
feedback 
operations. 
Using the RFCLK 
at twice the frequency 
of the main system 
clock 


also allows 
large amounts 
of data to be loaded very rapidly. 
However 
the speed 
of the fixed-to-float 
block, will 


not allow this double-pumping 
operation 
if the input data must be converted 
from integer to floating-point 
status. 
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ClK 
---A 
\ 
X 
I 
I 
I 
RFClK 
----1 
" 
1 
\ 
/, 
'-- 
I 


I \ 


I 
I 
I 
\ 
FTDAT 
I 
I 
I 
I 
I 
I 


ENDAT 
I I 
I 
\ 
I 
I 
I 
I 
I 


DA,DB 
~ 
: 
x 
DtTA 
X 


WEA 
I I 
\ 
I 
I 
\ 
I 
I 
I 
I 
I 


I 
I 
I 
WEB 
I 
\ 
I 
I 
I 
I 
I 
I 


SElRFA 
I I 
I 
\ 
I 
I 
I 
I 


AS·AO 
~F02 
X 
X 
FFOO 
X 
X 
FF02 
X 


BS·BO 
: 
X 
: RF01 
X 
: 


MIS-MIS 
~ 
: 
X 
INSTR: 
X 


MUlT 
I 
X 
I 
c= 
FEEDBACK 
PRODUCT 
X 
PRODUCT 


NOTE: 
The above diagram shows the general relationship of control signals and timing for double-pumping 
the register file. Input data on DA or 
DB buses are latched in the data registers while feedback data from the multiplier is written into register file location RF02 at the rising 
edge of ClK. 
The latched input data is then written into register file location RFOO and RF01 at the next rising edge of RFClK. 


data formats 


Externaldataformatsthat can be inputon the DAand DBpinsareas follows: 


1. 
Single Precision "oating-point, DEe F-Format 
131 
16115 114 
716 
01 


<---low 
mantissa---> 
sign<~xponent-> <-high-> 
mantissa 


Example: +27.125= H'000042D9'- 27.125= H'0000C2D9' 


2. 
Signed Integer, 32 bit 2s complement 
131 I 30 
01 


sign<-MSB 
LSB-> 


Example: +27= H'0000001B' - 27= H'FFFFFFE5' 


3. 
Unsigned Integer (or bit string), 32 bit 
131 
01 
<-MSB~----------------LSB-> 
Example: +27= H'0000001B' 


4. 
Signed Fraction, 32 bit 2s complement 
I 31 130 
01 


sign<-MSB---------------,LSB-> 
Note: magnitudeof bit 30 is 2E-1. 
Example: +0.125= H'10000000'- 0.125= H'FOOOOOOO' 
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5. 
Unsigned Fraction, 32 bit 
131 
01 


<-MSS----------------·LSB-> 
Note: magnitude of bit 31 is 2E-0. 
Example: +0.125 = H'10000000' 
+1.125 = H'90000000' 


Internal data formats as used by the multiplier and ALU are as follows: 


1. 
Single-Precision floating-point 
139 
32131 130 
01 


<-exponent-> 
sign <------·mantissa·------> 
17<---->01 
extended 
percision bits 


Note: The 2E-1 bit of the mantissa is not stored. Extended precision bits are zeroed on initial data input. 
Example: +27.125 = H'8559000000' - 27.125 = H'85D9000000' 


2. 
Signed Integer, 32 bit 2s complement 
139 
321 31 130 
01 


<-unused-->sign<-MSB·----------LSB--> 
Example: +27 = H'xx0000001 B' - 27 = H'xxFFFFFFE5' 


3. 
Unsigned Integer (or bit string), 32 bit 
139 
32131 
01 


<-unused-><-MSB~-------------LSB-> 
Example: +27 = H'xx0000001 S' 


4. 
Signed Fraction, 32 bit 2s complement 
139 
32131 130 
01 


<-unused->sign<-MSB------------·LSB-> 
Note: magnitude of bit 30 is 2E-1. 
Example: +0.125 = H'xx10000000' - 0.125 = H'xxFOOOOOOO' 


5. 
Unsigned Fraction, 32 bit 
139 
32131 
01 


<-unused-><-MSB-------------LSB-> 
Note: magnitude of bit 31 is 2E-0. 
Example: +0.125 = H'xx10000000' 
+1.125 = H'xx90000000' 


Data feedback from the multiplier or ALU will be stored in one of these formats. The output float-to-fixed block 
allows output of all of the input formats listed above plus fixed-point operands with the radix point aligned to any 
even bit boundary. 


In general, the terms floating-point number will be used to refer to data type 1 above. Integer numbers refers 
to both type 2 and 3 formats. Type 4 and 5 formats are fractional numbers. Integer and fractional numbers are 
collectively referred to as fixed~point numbers implying a fixed radix point position. Logical operands will usually 
conform to the integer format. The notation F, I, and J will occasionally be used in this data sheet to represent 
floating-point, integer, and fractional (or fixed-point) formats, respectively. 
. 


External floating-point inputs with a zero exponent and the sign bit set are considered to be reserved operands. 
These reserved operands are detected and flagged accordingly inthe fixed-to-float conversion blocks. Internally, 
floating-point numbers with a zero exponent are treated as floating-point numbers with a zero exponent and a 
zero mantissa. Internal floating-point numbers with a zero in the exponent, when inputto the float-to-fixed block, 
are converted to a floating-point number with a zero in the exponent and a zero mantissa. 
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multiplier 
architecture 
and operation 


The multiplier 
of the VPU is designed 
to perform 
normal 
multiplication 
on floating-point 
and integer 
operands 
and several 
other special 
purpose 
operations 
for Newton-Raphson 
division, 
square 
root, and double-precision 
arithmetic. 
The data flow diagram 
for the multiplier 
is illustrated 
in Figure 
2. 


The multiplier 
has two data input ports of 40 bits and one data output bus of 40 bits. In the multiplier, 
intermediate 
results are truncated 
rather than rounded. 
Error introduced 
by this truncation 
is usually 
insignificant 
for integer 
and fractional 
operations 
and acceptable 
for floating-point 
operations 
because 
of the extended 
precision. 
A 
64-bit mantissa 
is retained 
internal to the multiplier 
until the last step where it is truncated 
to its final output form. 
The 
multiplier 
internally 
uses an extended 
floating-point 
format 
(40 bits) that is not DEC 
F-format 
and may 
therefore 
produce 
different 
(and possibly 
more accurate) 
answers 
for sequences 
of more than two operations 
done internally 
in floating-point 
format 
(I.e., different 
from the result from a nonextended 
format). 


The VPU multiplier 
consists 
of three main. blocks: 


1. 
MULTIPLIER 
CORE: 
processes 
the mantissa 
portion 
of floating-point 
data or 32-bit 
signed 
or unsigned 
fixed-point 
operands. 


2. 
EXPONENT 
ADDER/ADJUST: 
processes 
the biased exponent 
and adjusts 
it according 
to controls 
from the 
mantissa 
shift block, and 
then generates 
floating-point 
status. 


3. 
MANTISSA 
SHIFT: normalizes 
the mantissa 
and generates 
the fixed-point 
status. 


Block 1 above 
multiplies 
the low-order 
32 bits of both 40-bit data input ports, Block 2 handles 
the high-order 
8 
bits when 
a floating-point 
operation 
is performed. 
The result 
of block 3 is to place the product 
output 
in the 


prescribed 
internal 
floating-point 
format 
with 1/2:$ mantissa 
< 1. 


data inputs 


Data for the two operands 
of the multiplier 
are selected 
from the following 
sources 
and by the multiplexer 
select 
codes shown 
in Table 5. 


MULTAMUX 
MULTBMUX 


SELM1 
SELMO 
OPERAND 
SOURCE 
SELM3 
SELM2 
OPERAND SOURCE 


L 
L 
Multiplier output bus 
L 
L 
Multiplier output bus 


L 
H 
ALU output bus 
L 
H 
ALU output bus 


H 
L 
Seed ROM 
H 
L 
Output numeric "1"t 


H 
H 
Register file port C 
H 
H 
Register file port D 
t The number '1' output by the B MUX will be in the correct integer or floating-point 
format needed as determined 
by 
the current multiplier instruction, 


DIV/SQRT 
SEED ROM 


L 
Floating-Point 
Division 


H 
Floating-Point 
Square Root 


. 
.l1~ 
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A Input 
••• 


SELM3 
SELM2 


SELM1 
SELMO 


ALU Feedback 


MULT Feedback 


DESTINATIONS 
MULTIPLIER 
OUTPUT ROUTE DESCRIPTION 


Multiplier input A 
MULT A MUX and MULT A register 


Multiplier input B 
MULT B MUX and MULT B register 


ALU input A 
ALU A MUX and ALU A register 


Register file 
RF MUXAt 


Youtput 
Product register and Y MUX 


t Feed through of multiplier resu~ via the register file is not allowed since direct 
feedback is available. 
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Newton-Raphson division support 


The multiplier provides support for doing Newton-Raphson division using single-precision operands. Division 
is only supported for floating-point format due to size limitations of the seed ROM. Support consists of a seed 
(see Table 6) and the implementation of a special instruction to perform the necessary operations of a 
Newton-Raphson division cycle. This cycle performs the iterative equation: 


where B isthe divisor and X(i) is a close approximation ofthe reciprocal of B.An instruction is available to perform 
the (2-B.X) step in one cycle without use of the ALU. The details of the implementation of this instruction and 
its restrictions and double-precision 
Newton-Raphson 
division is also supported and are described in 


Application Information section. 


For floating-point operands one cycle involves two separate steps which proceeds as follows: 


Step 1: 
Instruction (2 - A • B) 
A = input from the seed ROM or _1/B 
B = divisor 


Step 2: 
Instruction A. B 
A = input from the seed ROM or _1/B 
B = result from step 1 


Step one is the special instruction (2-A.B) where A is the seed for the division cycle and B is the divisor. Step 
two isjust a normal floating-point multiplication operation. Status isgenerated inthe same manner aswith normal 
floating-point operations. Two cycles (four multiplies) will be required to generate a single-precision reciprocal 
(1/divisor) from a seed of approximately 8 bits of accuracy. The final multiply of 1/B. A (1/divisor. dividend) is 
also a normal floating-point multiplication operation. 


The seeds for single-precision division (and square root) are generated from an internal seed ROM feeding the 
left multiplier input (MULT A) multiplexer that ontrols to the ROM select whether it is a seed for division or square 
root. Seeds for division have approximately eight bits of accuracy and are approximations to the actual 
reciprocals. Seeds for square root also have approximately eight bits of accuracy and are approximations to 
square root reciprocals of the numbers. The seeds are generated for optimum convergence of all input values. 
Consequently, the seed for certain values that can be represented exactly in the seed length will not be the exact 
reciprocal value since the seed is chosen to be the best for all values within the range. If desired, an external 
seed ROM can also be used to feed data through the DA or DB inputs and register file to the multiplier. The most 
significant bits of the reciprocal are critical for the Newton-Raphson process and will be set as shown below: 


B = 1.................•...•..... 
1 
131 
01 
<-exponent-> 
sign <----mantissa----> 
V 
V 
Adder 
Seed 
V 
ROM 
V 
V 


-1/B = 1•.••.•...••.•..•..•.•.••••. 
1•..•.•. 131 
01 
<--exponent-> 
sign <----mantissa'-----> 
<-zeroed----> 
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Seeds for square 
roots will be generated 
in the same manner 
except 
a different 
table for the ROM will be used 
(see Table 6). The exponent 
of a floating-point 
reciprocal 
will be produced 
from the biased exponent 
of the input 
using the following 
equations: 


Division : exponent = 257 - exponent 


(386 - exponent) 
Square Root: exponent = 
2 


The biased 
exponent 
value will give legal division 
seed exponent 
values 
for the range 2 to 255. If the biased 
exponent 
is 0, division 
is an undefined 
operation 
and the multiplier 
will set the divide-by-zero 
status bit since the 
value on the B input will be a zero. If the biased 
exponent 
is 1, the reciprocal 
seed overflows 
and the exponent 
presented 
to the A input will be zero and the multiplier 
will set the divide-by-zero 
status bit. For square 
root, the 
allowable 
range 
for the biased 
exponent 
value 
is 1 to 255. An exponent 
of zero 
is an error 
because 
the 
Newton-Raphson 
square 
root finds the reciprocal 
of the square 
root. In this case the multiplier 
will also set the 
divide-by-zero 
status 
bit. 


Newton-Raphson square root support 


The multiplier 
provides 
support 
for doing 
Newton-Raphson 
square 
root; the result is actually 
the reciprocal 
of 
the square 
root for single-precision 
operands. 
Square 
root is only supported 
for floating-point 
format 
because 
of size limitations 
for the seed ROM. Support 
consists 
of a seed (see Table 6) and the implementation 
of a special 
instruction 
to perform 
the necessary 
operations 
of a Newton-Raphson 
square 
root cycle. The square 
root cycle 
performs 
the iterative 
equation: 


X(i+ 
1) :=X(/) 
x ~ x (3-8 
X X(/) 
x X(/)) 


where 
B is the number 
whose 
square 
root is.desired 
and X(i) is a close approximation 
of the reciprocal 
of the 
square 
root of the number. 
An instruction 
is available 
to perform 
the 1/2*(3-K*I) 
step in one cycle without 
use 
of the ALU (K and I are the two input operands). 
The other instructions 
necessary 
for the square 
root cycle are 
the same as those needed for Newton-Raphson 
division 
(so no other special instructions 
are needed specifically 
for square 
root). 
Operands 
are in signed 
floating-point 
format 
but square 
roots 
of negative 
values 
are not 
supported. 
However, 
the 
sign 
is ignored 
and 
the 
resulting 
sign 
for 
negative 
values 
is 
undetermined. 


Double-precision 
Newton-Raphson 
square 
root will not be specially 
supported. 


For floating-point 
operands 
one cycle involves 
three separate 
steps which 
proceeds 
as follows: 


Instruction 
(A*B) 
A", 
input from the seed ROM or -1/SQRT(B) 
B = number 
whose 
square 
root is to be found 


Instruction 
(3-A*B)/2 
A = input from the seed ROM or -1/SQRT(B) 
B = result from step 1 


Step 3: 
Instruction 
A*B 
A = input from the seed ROM or -1/SQRT(B) 
B = result from step 2 


Step 1 is a normal multiply instruction 
for operand 
B and its seed. Step 2 is the special instruction 
(3-A*B)/2 where 
A is the seed for the square 
root cycle 
and B is the result from 
step one. Step 3 is another 
normal 
multiply 


instruction 
for the result from step 2 and the seed. 
Status 
is generated 
in the same 
manner 
as with normal 
floating-point 
operations. 
A final multiply 
of 1/SQRT(B)*B 
to get the actual square 
root of a number, 
if needed, 
is also a normal 
floating-point 
multiplication 
operation. 
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Newton-Raphson division support under the multiplier section covers the description ofthe process using seeds 
(the process is the same as division). Two cycles, six multiplies, will be required to generate a single-precision 
reciprocal square root from a seed of approximately eight bits of accuracy. 


ALU architecture 
and operation 


The ALU of the VPU is designed to perform addition and subtraction on floating-point and integer operands, 
perform logic functions on bit strings, and to perform several other special operations for double-precision 
support. The functional diagram for the ALU is shown in Figure 3. 


The ALU has two data input ports of 40 bits and an data output bus of 40 bits. The ALU internally uses an 
extended floating-point format (40 bits) that is not DEC F-format and may therefore produce different (and 
possibly more accurate) answers for sequences of more than two floating-point operations done internally. 
The implementation of the ALU for the VPU consists of the following main blocks: 


1. 
EXPONENT COMPARE: compares exponents and increments the exponent of the smaller operand to 
equal that of the larger operand. 


2. 
DENORMALlZE: denormalizes the smaller operand's mantissa according to the shift determined by the 
Exponent Compare block. 


3. 
ALU: performs all the arithmetic and logic operations. 


4. 
SHIFT LEFT: renormalizes the output of the ALU block. 


5. 
EXPONENT ADJUST: adjusts the exponent due to renormalization of the mantissa. 


Block 1 applies only to floating-point operands and handles the high-order 8 bits. Block 2 also applies only to 
floating-point operands and manipulates the mantissa section. Barrel shifting for integers is accomplished in 
blocks 2 and 4. 


data inputs 


Data for the two operands ofthe ALU are selected from the following sources and bythe multiplexer select codes 
as shown in Table 8. 


ALUAMUX 
ALU B MUX 


SELAO 
OPERAND 
SOURCE 
SELAl 
OPERAND 
SOURCE 


L 
Multiplier output bus 
L 
ALU output bus 


H 
Register file port E 
H 
Register file port F 


DESTINATIONS 
ALU OUTPUT ROUTE DESCRIPTION 


Multiplier input A 
MULT A MUX and MULT A register 


Multiplier input A 
MULT B MUX and MULT B register 


ALU input A 
ALU B MUX and ALU B register 


Register file 
RFMUXBt 


Youtput 
Sum register and Y MUX 


t Feedthrough 
of ALU result via the register file is not allowed 
since direct 
feedback is available. 
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A INPUT 
••• 


B INPUT 
••• 


ALU FEEDBACK 


MULT FEEDBACK 


40 
40 


ALUB 
MUX 


P REG OUTPUT 
••• 


data conversion 
operation 


Data conversion 
operations 
are available 
in the ALU to convert 
data between 
the different 
formats 
used by the 
VPU (See Figure 5). However, 
the necessity 
of using the ALU to perform 
these functions 
impacts 
the available 
throughput 
of the vector 
processor 
and complicates 
microcode 
development. 
Therefore, 
two 
special 
data 
conversion 
blocks are located on the input and output buses ofthe device to perform the most commonly 
needed 
conversions 
and special 
output formatting: 


fixed-to-float 
conversion 
for inputs 
DA and DB. 


float-to-fixed 
conversion 
for the Youtput. 
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Conversions 
may occur in parallel 
with multiplier 
and ALU operations. 
Note that the output of the fixed-to-float 
block and the inputto 
the float-to-fixed 
block are in the internal 
data formats. 
The F, I, and J notations 
used stand 


for floating-point, 
integer, 
and fixed-point 
formats, 
respectively. 
Fixed-point 
formats 
include 
the format 
1.31 
which 
is also referred 
to as fractional. 


I-- 


I 


I 


I 


IL __ 


----vPU] 


I 


I 


I 


I 
--~ 


flxed-to-float block 


The fixed-to-float 
block reformats 
data in the input formats 
to the proper internal floating-point 
or integer format. 


It also allows 
conversion 
of integers 
or fractional 
numbers 
to the internal 
floating-point 
format. 
This block has 
a pipeline 
register 
that may be used to pipeline 
data or in flow-through 
mode as necessary. 
Use of this pipeline 
register 
is required 
for maximum 
speed 
for input cycles 
where 
conversion 
from integer 
or fractional 
numbers 
to floating-point 
numbers 
is performed. 
For loading unconverted 
integers 
or floating-point 
numbers, 
this register 
may be left in flow-through 
mode. The control 
signals 
for this block are shown 
in Tables 
10 and 11. 


FTDAT 
ENDAT 
DATA LATCH FUNCTION 


L 
L 
Latch data in register on RFCLK 


L 
H 
Recirculate data in register 


H 
L 
Bypass (but latch data on RFCLK) 


H 
H 
Bypass data register 


FFEN 
FMAT 
MNEMONIC 
OPERATION 


L 
L 
F2F 
External floating-point 
to internal floating-point 


L 
H 
121 
Integer to integer 


H 
L 
J2F 
Fractional to internal floating-point 


H 
H 
12F 
Integer to internal floating-point 


When the input data registers 
are enabled, 
the control 
signals 
FFEN and FMAT are latched 
along with the data. 


When FTDAT is high and ENDAT is low, the register will latch data on the RFCLK rising edge butthe 
flow-through 
data will be used internally. 
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Integers and floating-point numbers are reformatted to the internal format of the vector processor without any 
loss of precision. The reformatting for floating-point and integer values is as follows: 


Input: Single Precision floating-point, DEC F-Format. 
131 
161 15 114 
716 
01 
<---low 
mantissa---> 
sign <-€xponent-> 
<-high-> 
mantissa 


Internal: 
Single Precision floating-point. 


139 
321 31 130 
01 
<-€xponent-> 
sign 
<-high-><----Iow----> 
mantissa 
mantissa 


For example: 
(-+2.785 
E-11) 
Input: 
11011111 
Internal: 
01011101 
01110101 
10110110 
11011111 
00101110 
10110110 
11110101 
00000000 


Input: Signed Integer, 32 bit 2s complement. 
131 130 
01 
sign <-MSB---------------LSB-> 


Internal: 
Signed Integer. 


139 
321 31 130 
01 
<-unused-> 
sign <-MSB-----------,LSB-> 


For example: 
(-541184267) 
Input: 
11011111 
Internal: 
00000000 
11011111 
10111110 
10111110 
00101110 
00101110 
11110101 
11110101 


NOTE: 
This mode would also be used for loading fractional numbers, bit strings. unsigned integers. 
multiple precision. and other formats. 


Conversion from integer or fractional to floating-point numbers may also be performed in this block. Because 
of the extended precision internal floating-point format, accuracy is not lost in this conversion. This process 
consists of taking the 2s complement of input values, normalizing the result, generating the exponent, and 
packing these pieces into the proper internal floating-point format. The following four examples are given. 


Signed Integer, 32 bit 2s complement to floating-point 


For example: 
(-5.4 
E8) 
Input: 
11011111 
10111110 
00101110 
11110101 
Internal: 
10011110 
10000001 
00000111 
01000100 
00101100 


For example: 
(-+ 1.6 E9) 
Input: 
01011111 
10111110 
00101110 
11110101 
Internal: 
10011111 
00111111 
01111100 
01011101 
11101010 


TEXAS -1!1 
INSlRUMENlS 


SN74ACT8867 
32·81T VECTOR PROCESSOR UNIT 


Signed Fractional Number, 32 bit 2s complement to floating-point 


For example: (-0.25) 
Input 
11011111 
10111110 
00101110 
11110101 
Internal: 
01111111 
10000001 
00000111 
01000100 
00101100 


For example: (-+0.7) 
Input 
01011111 
10111110 
00101110 
11110101 
mternat 
10000000 
00111111 
01111100 
01011101 
11101010 


If a reserved floating-point operand is attempted to be input, a status bit will be set in the status register. A 
separate bit will show this condition for the DA and DB input buses. The reserved floating-point operand is not 
converted to a normal floating-point zero. This status is considered an error condition and is OR'ed into the 
common ERROR status bit (See Table 12). 


SIGNAL 
SELST1 
SELSTO 
STATUS RESULT 


COUT/U 
H 
L 
Reserved operand on DA input 


STEX 
H 
L 
Reserved operand on DB input 
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float-to-fixed block 


The float-to-fixed 
block allows conversion 
from the internal floating-pointformatto 
integer and fractional 
formats 
as well as allowing 
output 
of ordinary 
integer 
and floating-point 
numbers. 
It also allows, 
under 
control 
of the 
binary 
point adjust 
pins BPA4-BPAO, 
alignment 
of the radix point (See Table 13). 


BPA4-BPAO 
MNEMONIC 
FORMAT 
RANGE 


00000 
F2JOO 
0.32 
-<l.5" 
X < +0.5 


00001 
F2J02 
2.30 
-2El 
"X 
< +2El 


00010 
F2J04 
4.28 
-2E3 
" X < +2E3 


00011 
F2J06 
6.26 
-2E5 
" X < +2E5 


00100 
F2J08 
8.24 
-2E7 
" X < +2E7 


00101 
F2Jl0 
10.22 
-2E9 
" X < +2E9 


00110 
F2J12 
12.20 
-2Ell 
"X 
< +2Ell 


00111 
F2J14 
14.18 
-2E13 
" X < +2E13 


01000 
F2J16 
16.16 
-2E15 
" X < +2E15 


01001 
F2J18 
18.14 
-2E17" 
X < +2E17 


01010 
F2J20 
20.12 
-2E19" 
X < +2E19 


01011 
F2J22 
22.10 
-2E21 
"X 
< +2E21 


01100 
F2J24 
24.8 
-2E23 
" X < +2E23 


01101 
F2J26 
26.6 
-2E25 
" X < +2E25 


01110 
F2J28 
28.4 
-2E27 
" X < +2E27 


01111 
F2J30 
30.2 
-2E29 
" X < +2E29 


10000 
F21 
32.0 
-2E31 
"X 
< +2E31 


10001 
F2J 
1.31 
-2EO " X < +2EO 


10010 
F2F 
Pass Floating-Point 
-2E127" 
X < +2E127 


10011 
121 
Pass Integer 
-2E31 
" X < +2E31 


10100 
Reserved 
· 
· 
· 
· 
· 
· 
· 
· 
· 
· 
· 
· 
11101 
Reserved 


11110 
PROBEZ 
PROBEZ 
See Parameter Measurement 
Information Serial Scan Testing 


11111 
SCANZ 
SCANZ 
See Parameter Measurement 
Information Serial Scan Testing 


F. I, and J are notations for floating-point, 
integer, and fixed-point formats respectively; n.1ll represents 


'n' bits to the left, and 'm' bits to the right olthe binary point; all J formats are 2s complement numbers 
where the high order bit is the sign. 


The radix point alignment 
operation 
may be performed 
only on converted 
floating-point 
values. 
The conversion 
process 
produces 
2's complement 
format 
output from the internal 
floating-point 
format. 
Conversion 
of integers 
or other formats 
is not supported. 
The iriput to the float-to-fixed 
block is selected 
through 
the Y MUX according 
to Table 14. 


SELY 
INPUT SOURCE 


L 
Multiplier (Product Register) 


H 
ALU (Sum Register) 
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The BPA4-BPAOcontrol pins and the RND pin are latched in the BPA register under control of the FTBPA and 
the ENBPA pins according to Table 15. 


FTBPA 
ENBPA 
BPA LATCH FUNCTION 
l 
l 
Latch BPA in register on ClK 
l 
H 
Recirculate BPA in register 


H 
l 
Bypass (but latch BPA on ClK) 


H 
H 
BypassBPA 
register 


The value on BPA4-BPAO sets the integer field size of the fixed-point format 
(Le. for 8.24 format, 


BPA4-BPAO= 4 hex). The internal floating-point value is converted into this fixed-point representations. 
Rounding or truncation may be performed during this process. The following two examples are given for both 
a without rounding condition and a with rounding condition. 


Without 
rounding: 


Internal: 
10000011 
00110001 
00000000 
10110110 
00111100 


(floating-point) 


BPA Fmt. 
Output: 
0.32 
xxxxxxxx 
overflow 
xxxxxxxx 


Output: 
2.30 
xxxxxxxx 
overflow 
xxxxxxxx 


Output: 
4.28 
01011000 
10000000 
01011011 
00011110 


Output: 
6.26 
00010110 
00100000 
00010110 
11000111 


Output: 
8.24 
00000101 
10001000 
00000101 
10110001 


etc. 


With rounding: 
Internal: 
10000011 
00110001 
00000000 
10110110 
00111100 


(floating-point) 


BPA Fmt. 
Output: 
0.32 
xxxxxxxx 
overflow 
xxxxxxxx 
Output: 
2.30 
xxxxxxxx 
overflow 
xxxxxxxx 
Output: 
4.28 
01011000 
10000000 
01011011 
00011110 
Output: 
6.26 
00010110 
00100000 
00010110 
11001000 


Output: 
8.24 
00000101 
10001000 
00000101 
10110010 


etc. 
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Rounding and trucation for data conversion is completed in the float-to-fixed block. Floating-point and integer 
values are reformatted for output as shown below: 


Internal: Single Precision floating-point. 
139 
321 31 130 
01 
<-exponent-> 
sign <-high-><-----,Iow-----> 
manttissa 
mantissa 


Output: Single Precision floating-point, DEe F-Format. 
131 
161 15 114 
716 
01 
<---low 
mantissa---> 
sign <-exponent-> 
<-high-> 
mantissa 


For example: 
Internal: 
01011101 
Output: 
(truncated) 
Output: 
(rounded) 


01110101 
11011111 
11011111 


11011111 
10110110 
10110111 


10110110 
00101110 
00101110 


10001100 
11110101 
11110101 


Internal: Signed Integer. 
139 
321 31 130 
01 
<-unused-> 
sign <-MSB-----------LSB-> 


Output: Signed Integer, 32 bit 2s complement. 
131 130.................................•................................................... 
01 
sign <-MSB---------------LSB-> 


For example: (no rounding is done for integers) 
Internal: 
00000000 
11011111 
10111110 
00101110 
Output: 
11011111 
10111110 
00101110 
11110101 
11110101 


NOTE: 
This mode would also be used for outpulling 
fractional 
numbers, 
bit strings, unsigned 
integers, multiple precision, and other formats. 


SIGNAL 
SELST1 SELSTO 
STATUS RESULT 


N 
H 
L 
Float-to-fixed 
output sign 


Z 
H 
L 
Float·to-fixed 
zero result 


OVER 
H 
L 
Float-to-fixed 
overflow 


After data and instructions are latched in the ALU or multiplier input registers, the status is available one clock 
cycle later including the propagation delay through the status register. The conversion process can overflow 
when large numbers are represented in a too small fixed-point format or from the rounding operation. For 
overflow, the sign bit indicates the direction of overflow. Values that are very small; i.e., zero exponent but 
mantissa bits not all zero, as well as reserved floating-point operands, are output as true zeroes (sign bit and 
mantissa bits are all zero). 
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The status 
bits are generated 
from the following 
equations: 


SIGN 
B31 = 1 
* 
NOT(BPA<H'12' 
* SHIFT>32 
* B31=1 * NOT(EXPO» 
* 
NOT(BPA<H'12' 
* SHIFT =32 * B31 =1 * ROUND=1 
* M31 Z) 


The SIGN bit is the same as the sign ofthe 
internal 
value except for formats 
where a negative 
value is too small 
to be represented 
or rounds 
to zero. 


ZERO 
= 
(BPA=H'13' 
* B31=0 * M31Z) 
+ 
(BPA<H'13' 
* EXPO) 
+ 
(BPA<H'12' 
* SHIFT>32) 
+ 
(BPA<H'12' 
* SHIFT=32 
* B31=0 * ROUND=O) 
+ 
(BPA<H'12' 
* SHIFT=32 
* B31=1 * ROUND=1 
* M31Z) 


The 
ZERO 
bit is set when 
an integer 
zero, 
a floating-point 
zero, 
or a reserved 
operand 
is input, 
or the 
floating-point 
value 
is too small to be represented 
in the selected 
format. 


OVER 
= 
(BPA=H'12' 
* EXP1 * ROUND=1 
* M24) 
+ 
(BPA<H'12' 
* SHIFT <0) 
+ 
(BPA<H'12' 
* SHIFT=1 
* B31=0 * ROUND=1 
* M31) 
+ 
(BPA<H'12' 
* SHIFT=O * B31=0) 
+ 
(BPA<H'12'*SHIFT=0*B31=1 
* NOT(M31Z» 


The OVER 
bit is set when 
a floating-point 
number 
is rounded 
and overflows 
or the value 
is too large to be 
represented 
in the selected 
format. 
Remember 
that the most significant 
bit is the sign bit for fixed-point 
output 
formats. 


B31 
EXP1 
EXPO 
M31 
M31Z 
M24 


value on the binary 
point adjust 
pins (4-0) 
128 - exponent 
+ (2 * BPA4-BPAO) 
(for BPA < 11 hex) 
or 128 - exponent 
+ 1 (for BPA = 11 hex) 
sign bit (bit 31) 
bits of the exponent 
(39-32) 
are all 1 
bits of the exponent 
(39-32) 
are all 0 
bits of the mantissa 
(30-00) 
are all 1 
bits of the mantissa 
(30-00) 
are all 0 
bits of the mantissa 
(30-07) 
are all 1 


BPA 
SHIFT 


rounding and truncation 


If selected, 
the float-to-fixed 
block performs 
rounding 
on internal 
floating-point 
values 
when they are output 
as 
standard 
DEC 
F floating-point 
or 2s complement 
fixed-point 
format. 
The fixed-point 
output 
format 
1Jsed is 
selected 
through 
the BPA pins. The rounding 
method 
is standard 
rounding 
procedure 
withe a one in the round 
bit rounding 
to the next higher value. For floating-point 
values 
output in DEC F floating-point 
format, 
this results 
in a round toward 
plus or minus infinity 
depending 
on the sign bit. The truncation 
is toward 
zero for this case. 
For floating-point 
values converted 
to fixed-point 
values, the rounding 
is similar whereas 
the truncation 
operation 
is toward 
minus infinity. The truncation 
performed 
in the ALU for floating-point 
to integer format and floating-point 
to fractional 
format 
is different. 
For these instructions, 
truncation 
is toward 
zero because 
the floating-point 
input 
is first 
shifted 
and 
truncated, 
and then 
2s complemented 
to create 
the fixed-point 
representation. 
In the 
float-to-fixed 
block, the truncation 
is performed 
after the 2s complement 
because 
of implementation 
restrictions. 
Figures 
5, 6, and 7 illustrate 
these 
differences 
along with an example. 
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FLT(1.6) 
=> INT(2) 
FLT(1.3) 
=> INT(1) 
FLT(-.8) 
=> INT(-1) 
FLT(-.4) => INT(O) 


-2 
-1 
0 
2 
. . I· 
· 


FLT(1.6) 
=> INT(1) 
FLT(1.3) 
=> INT(1) 


FLT(-.8) 
=> INT(-1) 


FLT(-.4) 
=> INT(-1) 


-2 
-1 
0 
2 
I • I • 
• 
• 
+-- +-- +-- +-- 


(Internal 
floating-poInt 
to DEe floating-poInt) 


-2 
-1 
0 
2 
. . I· 
· 
+-- +-- +-- +-- 
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Example: 
FLT(1.6) 
=> INT(1) 
FLT(1.3) 
=> INT(1) 
FLT(-.8) 
=> INT(O) 
FLT(-.4) 
=> INT(O) 


-2 
-1 
0 
2 
I· 
· I· 
· 


(Internal 
floating-point 
to DEC floating-point) 


-2 
-1 
0 
2 


Figure 7. ALU Truncation 


Rounding 
is performed 
when the RND signal is high and is not performed 
when the RND signal is low. The RND 
input 
is acknowledge 
by the 
BPA register. 
No rounding 
is performed 
when 
outputting 
integer 
or fractional 
numbers. 
The rounding 
operation 
may overflow 
and this is flagged 
in the appropriate 
bit of the status 
register. 


No other 
rounding 
is performed 
anywhere 
in the device. 
All multiplier 
and ALU 
results 
are truncated, 
which 
prevents 
double 
rounding 
and allows 
multiple-precision 
operations 
to be supported. 


flow-through 
and pipeline 
mode 


The data registers 
in the Vector 
Processing 
Unit (VPU) have been configured 
so that they may be enabled 
in 
pipeline 
mode or disabled 
in flow-through 
mode to match the supporting 
hardware 
and software 
configuration. 
Each register 
or closely 
related group of registers 
has its own independent 
controls 
to allow maximum 
flexibility. 
The register 
file uses special 
settings 
of its existing 
control 
pins to activate 
a flow-through 
type mode through 
this block. The entire device 
may be used in flow-through 
mode, pipelined 
mode, or separate 
sections 
may be 
independently 
controlled 
as dictated 
by the application 
requirements. 


The control 
pins available 
on the VPU and the associated 
registers 
they control 
are itemized 
in Table 17. 


REGISTER 
ENABLE 
FLOW-THROUGH 
CLOCK 


Fixed-to-float 
A and B 
ENOAT 
FTOAT 
RFClK 


Register File 
WEA,WEB 
WEA, WEB, and special address 
RFClK 
(46 and 47 decimal) 


Mult MUX A and B 
ENM 
FTMA 
ClK 


Multiplier Instruction 
ENM 
FTI 
ClK 


AlUAand 
B 
ENA 
FTMA 
ClK 


AlU 
Instruction 
ENA 
FTI 
ClK 


Product 
ENP 
FTPS 
ClK 


Sum 
ENS 
FTPS 
ClK 


Binary Point Adjust and Round Mode 
ENBPA 
FTBPA 
ClK 


Status 
- 
FTST 
ClK 
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_ •. - An fTow-ftfrough 
controls 
are active 
high with the exception 
of the register 
file flow-through 
mode, which 
does 


not use a separate, 
explicit 
flow-through 
control 
pin. When 
a flow-through 
control 
is high, the corresponding 


register 
will be bypassed. 
The data on the input will appear at the output. 
In this mode, the enable will still control 


whether 
the bypassed 
register 
is updated 
on a clock 
pulse or not. When 
the flow-through 
control 
is low, the 


register 
will act as a normal 
register 
and the output will be the registered 
data. 


All enables 
for this device 
are active 
low. When 
an enable 
is low, a rising edge on the appropriate 
clock signal 


(CLK or RFCLK) 
will latch the data into the register. While an enable 
is high, data in the register 
will be retained 


regardless 
of the clock signal or flow-through 
controls. 
The enables 
for the register 
file work in conjunction 
with 


the address 
controls 
to select the correct register to latch the input data. The write address 
and write enable pins 


on the register 
file (AS-AD and BS-BD, WEA and WEB, 
respectively) 
are always 
latched 
on the rising edge of 
RFCLK. 


The 
arrangement 
of controls 
has been 
made 
to allow 
maximum 
flexibility 
with a minimum 
number 
of pins. 
Instructions 
are typically 
latched 
with the data values 
but may be put in the flow-through 
mode independently. 
This allows an external 
pipeline 
register to be employed 
in those cases where even further flexibility 
is required. 
The binary-point 
adjust format 
and rounding 
mode may be latched 
if they do not need to be changed 
frequently. 
Data flow may also be controlled 
through 
the registers 
and feedback 
paths. Data may be stalled 
at the multiplier 


or ALU inputs independently 
or held at the Y output while other operations 
continue 
with output to the register 


file. With the multiplier 
and ALU data latches 
in the flow-through 
mode, the computational 
blocks of the VPU may 


even be configured 
as a flow-through 
multiplier 
accumulator 
with latched 
or flow-through 
output. 
The status 


register 
may be selected 
for the registered 
or the feedthrough 
mode as desired. 


status 


Status for the 'ACT8867 
is generated 
for error conditions, 
calculation 
results, and conversion 
operations. 
Status 


is latched 
on the rising 
edge 
of the CLK signal 
and will be available 
from 
the status 
register 
on the cycle 


subsequent 
to the operation 
where 
it is generated. 
All status 
bits are latched 
on every rising edge of the CLK 


signal with the exception 
ofthe 
Initial Error Source 
(IES) status bits. These bits are specially 
enabled 
by an error 


condition 
as described 
in Table 
18. The flow-through 
control 
on the status 
register 
may be used for special 


applications 
where 
the status 
is needed 
as soon as it is generated. 
An output enable 
control 
is available 
to set 


the status output 
pins to high-impedance 
state as needed. 


Both floating-point 
and integer 
operations 
produce 
status 
as appropriate 
for their operation 
with this status 
generated 
primarily 
by three main blocks: the multiplier, 
the ALU, and the float-to-fixed 
blocks. The status from 


each location 
is selected 
for output through 
the six status pins by the SELST1-SELSTD 
controls 
(see Table 18). 


SELST1 
SELSTO 
STATUS REGISTER 
SOURCE 


L 
L 
MUltiplier 


L 
H 
ALU 


H 
L 
Float-to-fixed 


H 
H 
Initial error 


The 
ERROR 
pin is set when 
an overflow 
or underflow 
has occurred, 
a divide-by-zero 
is attempted 
in the 


multiplier, 
or a reserved 
floating-point 
operand 
is input to the device. 
This status 
signal 
may not be valid for 
double-precision 
algorithms 
using 
the 
single-precision 
integer 
operations 
in the 
multiplier 
and 
ALU 
since 


overflows 
of the low-order 
pieces will be added to the high-order 
pieces 
in subsequent 
operations. 
When 
this 


signal becomes 
high, it enables 
the six IES status register 
locations 
to latch on the next rising edge of the CLK 


signal the status bits marked with a dagger (t) in Table 19, except for the NaN operand 
status. These status bits 


will then be held until the status source 
of the initial error is selected 
and released. 
After exiting from this state, 


these 
registers 
are enabled 
to accept 
status from new master 
errors . 


. 
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STATUS PIN 
SELST1 
SELSTO 
STATUS RESULT 


L 
L 
Set when any error condition occurs in 'ACT886? 


ERROR 
L 
H 
Set when any error condition occurs in 'ACT886? 


H 
L 
Set when any error condition occurs in 'ACT886? 


H 
H 
IES multiplier underflow 


L 
L 
Multiplier underflow t 


COUT/U 
L 
H 
ALU underflow(floating-point 
operation) or carray out bit(interger operation) t 


H 
L 
NaN on DA inputt 


H 
H 
IES ALU underflow 


L 
L 
Multiplier result sign 


N 
L 
H 
ALU result sign 
. 


H 
L 
Float~to-fixed output sign 


H 
H 
IES multiplier overflow 


L 
L 
Multiplier zero result 


Z 
L 
H 
ALU zero result 


H 
L 
Float-to-fixed 
zero result 


H 
H 
IES ALU overflow 


L 
L 
Multiplier overflow t 


OVER 
L 
H 
ALU overflowt 


H 
L 
Float-to-fixed 
overflowt 


H 
H 
IES float-to-fixed 
overflow 


L 
L 
Divide by zero in multipliert 


STEX 
L 
H 
ALU link flip flop 


H 
L 
NaN on DB inputt 


H 
H 
IES divide by zero 


t The ERROR pin is theresult of the OR of all status bits. Only ALU floating-point 
underflow will set this bit, not the carry-out 
on integer operations. 


The multiplier 
and ALU both generate 
status on the result offloating-point 
and integer operations. 
The sign bit, N, 
will be set if the result is negative. 
The zero bit, Z, will be set ifthe result is exactly zero. Underflow 
will be signaled 
for floating-point 
operations 
when the result is too small to be represented 
in a normalized 
format. 
Overflow 
will 
occur for floating-point 
and integer 
results 
that are too large to be represented 
in the output format. 
Unsigned 
integer overflow 
for the ALU is shown 
by the COUT /U bit. The overflow 
flag will need to be ignored 
in microcode 
for special 
cases 
involving 
double-precision 
operations. 
The Instruction 
Set section 
has further 
information 
on 
multiplier 
operations 
and ALU operations. 


The conversion 
blocks 
fixed-to-float 
and float-to-fixed 
generate 
several 
special 
status 
bits. The fixed-to-float 
blocks signal the input of a NaN value with no other status generated. 
The float-to-fixed 
block generates 
status 
on the sign and zero value ofthe output and overflow. The overflow 
bit will be set when converting 
a floating-point 
number 
to too small of a fixed-point 
format 
or when the rounder 
causes 
an overflow, 


integer status 


Integer status is generated 
from the multiplier 
and the ALU and is shown on four status pins, COUT/U, 
N, Z, and 
OVER. 


Multiplier 
instructions 
operate 
on both 2s complement 
and unsigned 
integers 
and report integer 
status 
on the 
N, Z, and OVER 
signals. 
Underflow 
cannot 
occur for integer 
operations 
in the mUltiplier. 
In single-precision 
multiplication 
the product 
may be up to 64 bits in length. If it is desired to keep a 32-bit product, 
then any non-zero 
bits in bits 32 to 63 indicate 
the result has overflowed. 
Double-precision 
operations 
in the multiplier 
will retain 
all 64 bits. For example, 
a 32-bit by 32-bit multiplication 
has no overflow. 
The low-order 
multiply 
will appear 
to 
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overflow 
since the multiplier 
assumes 
a 32-bit result is wanted 
but this bit should be ignored 
in microcode 
at this 


step. 
The 
ZERO 
status 
should 
also 
be 
carefully 
interpreted 
since 
both 
parts 
must 
be 
zero 
before 
a 


double-precision 
result can be determined 
to be exactly 
zero. 


The ALU reports 
integer 
status on all four of the status pins. Unsigned 
operations 
use the Z and COUT/U 
bits. 
The zero bit is set during 
a subtraction 
when 
both operands 
are equal. The COUT/U 
bit is set during 
addition 


indicating 
an overflow. 
The COUT/U 
bit goes to zero during subtraction 
if the result underflows. 
2s complement 


operations 
use the Z, N, and OVER 
status 
bits. The Z bit is set when a 2s complement 
operation 
produces 
a 


zero result. Together the N (sign) bit and the OVER bit denote overflow. 
If the overflow 
is in the positive 
direction, 


then OVER 
is high and N is high. If it is negative, 
then OVER 
is high and N is low. The underflow 
bit is not used 


for 2s complement 
integer 
operations. 
For the multiplier, 
double-precision 
operations 
in the ALU can generate 


invalid overflow 
and zero status. The intermediate 
steps can generate 
valid carries from the MSB that get stored 


in carry flip-flops 
but will signal2s 
complement 
single-precision 
overflow. 
For unsigned 
integers, 
both underflow 


and overflow 
could be incorrectly 
signaled. 
The final step ofthe double-precision 
algorithm 
will give a valid status. 
The microcode 
will need to handle 
these 
cases appropriately. 


The float-to-fixed 
block generates 
only Nand Z status bits for integer operations 
because 
there is no conversions 


done on integer 
operands 
and rounding 
cannot 
occur. 


floating-point 
status 


Floating-point 
status 
is generated 
from the multiplier, 
ALU, and float-to-fixed 
block; and is shown 
on the four 


status 
pins: COUT/U, 
N, Z, and OVER. 


The ALU and multiplier 
report status for floating-point 
operations 
similarly. 
When 
the result of a floating-point 


operation 
is too small to be represented 
in a normalized 
format, 
the underflow 
flag (COUT/U) 
is asserted 
and 


the result is forced to zero. When the exponent 
of the result is larger than 127 decimal 
(unbiased), 
the overflow 


flag (OVER) 
is asserted 
and the result is undetermined. 
The N (sign) bit is a duplicate 
of the sign of the result. 


and has additional 
meaning 
for a compare 
operation. 


The zero (Z) bit is set when the result is zero. However, 
internal 
to the VPU, a floating-point 
zero is determined 


by an exponent 
field of all zeroes, 
even if some mantissa 
bits are nonzero. 
During a PASS instruction, 
operands 


with zero mantissas 
cause zero status to be signalled, 
even ifthe exponent 
is nonzero. 
Conversely, 
an operand 


with zero exponent 
and nonzero 
mantissa 
is passed 
without 
signalling 
zero status, 
even though 
the operand 


will be forced 
to zero on output. 


The float-to-fixed 
block 
converts 
floating-point 
operands 
to fixed-point 
values 
and generates 
status 
for this 
operation. 
The floating-point 
value, 
however, 
may be too big or too small to be represented 
in the selected 


fixed-point 
format. 
When the value is too large, the overflow 
(OVER) 
status 
bit will be set and the output value 


will be undefined. 
When 
a floating-point 
value is too small to show any nonzero 
bits in the fixed-point 
result, 
a 


zero will be output. 
The underflow 
flag is not asserted 
by the float-to-fixed 
block. Only the zero flag will be set 


for this situation. 
The rounding 
section of the float-to-fixed 
block may have a carry out of the MSB of the mantissa 
when outputting 
floating-point 
numbers. 
If so, a one bit normalization 
is required 
resulting 
in an increment 
of the 
exponent. 
If the exponent 
is incremented 
beyond 
127 decimal 
(unbiased) 
an overflow 
will occur and OVER will 
be set. The SIGN and ZERO bits are set appropriately 
for floating-point 
format and converted 
fixed-point 
output. 


integer 
compare 


The ALU ihstruction 
set does not include 
an integer compare 
instruction. 
However, 
the ALU status reported 
for 
the integer subtraction 
instruction 
(A + B + CIN), with CIN pin being set high provides 
the information 
necessary 
to compare 
the magnitude 
of two unsigned 
or 2s complement 
numbers. 
This instruction 
will set the N (sign), 
zero (Z), and overflow 
(OVER) status bits. The interpretation 
of the status bits required 
to determine 
the relation 
of the two operands 
is given in Table 20. 


TEXAS .J!1 
INSTRUMENTS 


SN74ACT8867 
32·BIT VECTOR PROCESSOR UNIT 


UNSIGNED 
NUMBERS 
25 COMPLEMENT 
NUMBERS 
RELATION 
STATUS BITS 
STATE 
STATUS 
STATE 


AEQB 
Z 
HIGH 
Z 
HIGH 


ANEB 
Z 
LOW 
Z 
LOW 


AGEB 
COUT/U 
HIGH 
NXNOROVER 
HIGH 


ALTB 
COUT/U 
LOW 
NXOROVER 
HIGH 


AGTB 
COUT/U AND (NOT Z) 
HIGH 
(N XNOR OVER) AND (NOT Z) 
HIGH 


ALEB 
(NOT COUTIU) OR Z 
HIGH 
(N XOR OVER) OR Z 
HIGH 


floating-point 
compare 


The status 
from 
the compare 
A and 
B instruction 
(eMP) 
is generated 
by the ALU to provide 
a method 
for 


determining 
the relationship 
of two floating-point 
values. 
The normal floating-point 
subtract 
operation, 
because 


of underflow 
and overflow, 
cannot 
generate 
complete 
status for all cases. 
The compare 
operation 
will set the 


status correctly 
for the values 
of the operands. 
The underflow 
and overflow 
bits are never set by this instruction 


since the operation 
being 
performed 
is a compare 
and not a true subtraction. 
The instruction 
generates 
the 


N (sign) and Z (zero) bits, which 
are interpreted 
in a similar 
manner 
to the integer compare 
above to determine 


the relationship 
of the numbers. 
No meaningful 
data is output 
when this operation 
is executed. 


The algorithm 
used by the compare 
instruction 
first compares 
the signs of the two operands. 
If the signs are the 


same, then a subtraction 
will be performed. 
Otherwise 
the status bits can be determined 
directly. Table 21 shows 


the complete 
status generation 
algorithm. 
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SIGNA 
SIGN B 
OPERATION 
ALGORITHM 


OVER is low 
IF result < 0 THEN N is high (A LT B) 
- 
- 
IAI-IBI 
ELSE N is low (A GE B) 
IF result = 0 THEN Z is high (A EO B) 
ELSE Z is low (A NE B) 


IAI 
+ IBI 
OVER is low; N is low; Z is low; 
+ 
- 
result is always> 
0 (A GT B) 


OVER is low 
IF result < 0 THEN N is high (A LT B) 


+ 
+ 
IAI-IBI 
ELSE N is low (A GE B) 
IF result = 0 THEN Z is high (A EO B) 
ELSE Z is low (A NE B) 


-IAI 
±IBI 
OVER is low; N is high; Z is low; 
- 
+ 
result is always < 0 (A LT B) 


FLOATING-POINT 
NUMBER 
RELATION 
STATUSt 
STATE 


AEOB 
Z 
HIGH 


ANEB 
Z 
LOW 


AGEB 
N XNOR OVER 
HIGH 


ALTB 
NXOROVER 
HIGH 


AGTB 
(N XNOR OVER) AND (NOT Z) 
HIGH 


ALE 
B 
(N XOR OVER) OR Z 
HIGH 


t OVER (and COUT/U) will always be low. It has been included in the 
above equations 
to provide compatibility 
between these and the 2s 
complement 
relationships. 


data and system 
Integrity 
output parity 


Even parity is generated 
from the final result of the float-to-fixed 
block and presented 
externally 
at the output 
PY. When 
output 
enable 
OEY is high, this output 
is in the high-impedance 
state. When 
OEY is low, it is in the 
normal 
output 
mode. 


test pins 


Two test pins on the VPU control 
the device 
test modes 
and are given in Table 23. 


TP1 
TPO 
OPERATION 


L 
L 
Outputs driven low 


L 
H 
Outputs driven high 


H 
L 
Outputs driven high-impedance 


H 
H 
Normal device operation 


The primary 
purpose 
of the test pins is for testability 
requirements 
by Texas 
Instruments. 
They are useful for 
fault-tolerant 
systems 
to allow a malfunctioning 
part to be effectively 
removed 
from the system 
by forcing 
all its 
outputs 
to a high-impedance 
state. 
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absolute maximum ratings over operating free-air temperature range (unless otherwise noted)t 


Supply voltage, Vcc(see Note 1) 
- 0.5 V to 7 V 
Input clamp current, 11K (VI < 0 or VI > Vccl 
± 20 mA 
Output clamp current, 10K (Va < 0 or Va> 
Vccl 
± 50 mA 
Continuous ouput current, la (Va = Vccl 
± 50 mA 
Continuous current through VCC or GND pins.. 
. . 
. . . . . . .. . . . .. . . . . . . .. . . . . . . . . . 
. . . .. 
±100 mA 
Operating free-air temperature range 
O°Cto 70°C 
Storage temperature range 
- 65°C to 150°C 


t Stresses beyond those listed under "absolute maximum ratings" may cause permanent damage to the device. These are stress ratings only and 
functional 
operation of the device at these or any other conditions 
beyond those indicated under "recommended 
operating conditions· 
is not 


implied, Exposure to absolute-maximum-rated 
conditions for extended periods may affect device reliability, 


NOTE 1: All voltage levels are with respect to GND. 


MIN 
NOM 
MAX 
UNIT 


VCC 
Supply voltage 
4,75 
5 
5,25 
V 


VIH 
High-level input voltage 
2 
VCC 
V 


VIL 
Low-level input voltage 
-0.3 
0.8 
V 


IOH 
High-level output current 
-8 
mA 


IOL 
Low-level output current 
8 
mA 


TA 
Operating free-air temperature 
0 
70 
·c 


electrical characteristics over recommended operating free-air temperature 
range (unless 


otherwise noted) 


PARAMETER 
TEST CONDITIONS 
MIN 
TYP 
MAX 
UNIT 


VCC =5,25V. 
10H =-8 
mA 
4.26 


VOH 
High-level output voltage 
V 
VCC = 4.75 V. 
10H =-8 
mA 
3.76 


VCC = 5,25 V, 
10L= 8 mA 
0.45 
VOL 
Low-level output voltage 
V 
Vcc 
= 4.75 V, 
10L= 8 mA 
0.45 
I All Others 
VCC = 5.25 V. 
VI = VCC to 0 
%5 
!JA 
II 
Input current 
I TPO &TP1 
VCC = 5,25 V, 
VI = VCC to 0 
%25 
!JA 


IZ 
I/O or output off-state current 
VCC = 5,25 V, 
VI = VCC to O. 
I/O or output off 
%10 
!JA 


ICCQ 
Quiescent supply current 
VCC = 5,25 V. 
VI = 0.2 V to VCC -0,2 
V 
1 
mA 


ICC+ 
Operating supply current 
VCC= 
5.25V. 
VI = 0.2 V to VCC -0.2 
V 
10 =0 
200 
mA 


Ci 
Input capacitance 
10 
pF 


; ICC is measured with maximum clock frequency equivalent to 16 MHz. Inputs should be presented with random logic highs and lows to assure 


the toggling of internal nodes. Outputs are presented with capacitive loads of 50 pF. 
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switching characteristics over recommended ranges of supply voltage and operating free-air 
temperature (unless otherwise noted) 


PARAMETER 
MIN 
TYP! 
MAX 
UNIT 


t01 
Propagation 
delay, clock to producVsum registers to Y outputs, all BPA cases 
30 
45 
ns 


tp2 
Propagation 
delay, clock to MUlT/AlU 
registers io status outputs with FTST high 
42 
58 
ns 


t03 
Propagation 
delay, clock to status registers to status outputs with FTST low 
16 
24 
ns 


Propagation 
delay, clock to MUlT/AlU 
registers and flow-through 
to Y bus with 
44 
70 
ns 
tp4 
FTMA = ENM = ENA = low and FTPS = FTBPA = high 


tp5 
Propagation 
delay, clock to register file and flow-through 
to Y bus (read same location written) 


50 
80 
with WEA = WEB = low and FTMA = FTPS = FTBPA = high 
ns 


Propagation delay, clock to DNDB 
registers and flow-through 
to Y bus, with 


tp6 
WEA = WEB = high, RA = 2E hex, RB = 2F hex, FTMA = FTPS = FTBPA = high, and 
50 
80 
ns 
fixed-to-float 
inactive 


Propagation 
delay, clock to DNDB 
registers and flow-through 
to Y bus, with 


tp7 
WEA = WEB = high, RA = 2E hex, RB = 2F hex, FTMA = FTPS = FTBPA = high, and 
60 
gO 
ns 
fixed-to-float 
active 


tp8 


Propagation 
delay, DNDB 
inputs to Y bus, with WEA = WEB = high, RA = 2E hex, RB = 2F hex, 


and FTDAT = FTMA = FTPS = FTBPA = high 
55 
85 
ns 


tpg 
Propagation 
delay, SElY to Y bus, float-to-fixed 
in pass mode 
20 
30 
ns 


tol0 
Propagation 
delay, ClK 
and SElY to Y bus, all BPA cases, with FTPS = ENP = ENS = low 
25 
42 
ns 


tp11 


Propagation 
delay, SElY to Y bus, all BPA cases including float-to-fixed 
conversion with binary 


28 
37 
point adjust, with FTPS = ENP = ENS = low 
ns 


t012 
Propagation 
delay, BPA to Y bus, all BPA cases with FTBPA high 
30 
45 
ns 


t013 
Propagation 
delay, MI and AI instruction bits to Y bus with FTPS high 
30 
62 
ns 


tp14 
Propagation 
delay, read address ports C, D, E, or F to Y bus, with FTMA = FTPS = high 
48 
75 
ns 


tp15 


Propagation delay, read address ports C, D, E, or F to Y bus through seed ROM, with 
55 
85 
FTMA = FTPS = high 
ns 


t016 
Propagation 
delay, SElST 
to status output 
15 
25 
ns 


ten1 
Enable time, OEY to Y data outputs 
12 
20 
ns 


ten2 
Enable time, OES to status outputs 
12 
20 
ns 


tdis1 
Disable time, OEY to Y data outputs 
12 
20 
ns 


tdis2 
Disable time, OES to status outputs 
12 
20 
ns 
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temperature 


PARAMETER 
MIN 
MAX 
UNIT 


tc1 
Cycle time, ClKT 
62 
ns 


tc2 
Cycle time, RFClK* 
31 
ns 


tw1 
Pulse duration, ClK 
high 
10 
ns 


tw2 
Pulse duration, ClK 
low 
12 
ns 


tW3 
Pulse duration, RFClK 
high 
10 
ns 


tW4 
Pulse duration, RFClK 
low 
12 
ns 


tsu1 
Setup time, DNDB data bus input before RFClK 
10 
ns 


tsu2 
Setup time, register file write address NB port before RFClK 
12 
ns 


tsu3 
Setup time, register file enables WEA/WEB 
before RFClK 
10 
ns 


tsu4 
Setup time, DNDB 
register enables before RFClK 
10 
ns 


tsu5 
Setup time, register enables (ENM, ENP, ENS, and ENA) before ClK 
10 
ns 


tsu6 
Setup time, SElM, 
SELA before ClK 
with ENM = ENA = low 
10 
ns 


tsu7 
Setup time, instruction bits MIIAI before ClK with ENM - ENA - low 
10 
ns 


tsuB 
Setup time, binary point adjust BPA before ClK with ENBPA low 
10 
ns 


tsu9 
Setup time, select status SElST 
before ClK 
12 
ns 


tsu10 
Setup time, round input RND before ClK 
with ENBPA low 
10 
ns 


tsu11 
Setup time, carry input CIN before ClK with ENA low 
10 
ns 


tsu12 
Setup time, select register file MUX SElRFNB 
before RFClK 
10 
ns 


tsu13 
Setup time, fixed-to-float 
enable FFEN before RFClK 
10 
ns 


tsu14 
Setup time, fixed-to-float 
format FMAT before RFClK 
10 
ns 


th1 
Hold time, DNDB data bus input after RFClK 
6 
ns 


th2 
Hold time, register file write address NB port after RFClK 
2 
ns 


th3 
Hold time, register file enables WEA/WEB 
after RFClK 
2 
ns 


tM 
Hold time, DNDB 
register enables after RFClK 
2 
ns 


th5 
Hold time, register enables (ENM, ENP, ENS, and ENA) after ClK 
2 
ns 


th6 
Hold time, SElM, 
SELA after ClK with ENM = ENA = low 
4 
ns 


th7 
Hold time, instruction 
bits MI/AI after ClK with ENM = ENA = low 
6 
ns 


thB 
Hold time, binary point adjust BPA after ClK with ENBPA low 
6 
ns 


th9 
Hold time, select status SElST 
after ClK 
6 
ns 


th10 
Hold time, round input RND after ClK with ENBPA low 
6 
. 
ns 


th11 
Hold time, carry input CIN after ClK with ENA low 
6 
ns 


th12 
Hold time, select register file MUX SElRFNB 
after RFClK 
2 
ns 


th13 
Hold time, fixed-to-float 
enable FFEN after RFClK 
5 
ns 


th14 
Hold time, fixed-to-float 
format FMAT after RFClK 
5 
ns 
t Fully pipelined, rising edge to rising edge 
* Permits two write cycles to register file during one ClK 
period (dOUble-pumping) 
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Supply Current 
vs 
Frequency 


Vcc 
= S V 
TA=2SoC 


c( 
i 
~ 
"u 
-a 
30 
a. 
" 
lI? 
u 
20 
U 


serial scan testing mechanism 


Texas Instruments provides the scan-testing hardware features described here. The hardware for scan testing 
included on the 'ACT8867 is designed to reduce the enormous problem of fault detection for the whole device 
to several more manageable fault detection tasks. These tasks are divided amoung the register file, the multiplier 
core, the ALU core, the fixed-to-float logic, and the float-to-fixed logic as isolatable sections. 


Features of Scan Test Mechanism: 


1. 
Capability to serially load and serially dump all registers except the instruction register, status register, and 
the BPA register for development and improvement of patterns to a high grade level. Because of the 
prohibitive hardware complexity that would result, the register file is also excluded from scanning. 


2. 
Probe testing with contact reqUired to less than 100 of the 208 device pins. 


3. 
Three serially loadable registers to drive input buses: DAFMT, DBFMT, RFADDR 


4. 
One serial output register to read Y Bus: YSCAN 


5. 
Minimal interaction between critical delay path circuits and test circuitry 


6. 
Ten total serial input and output scan paths with a maximum of 40 bits each 


The serial scan chains are implemented as illustrated in Table 24. 
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REGISTER 
SERIAL INPUT 
SERIAL OUTPUT 
NUMBER OF FFs 


MULTA 
SELMO 
ERROR 
40 


MULT B 
SELMl 
COUT/U 
40 


ALUA 
SELAO 
N 
40 


ALU B 
SELAl 
Z 
40 


PRODUCT 
SELM3 
OVER 
40 


SUM 
CIN 
STEX 
40 


RFADDR 
A5 
Y3 
36 


DAFMT 
DA31 
Y2 
32 


DBFMT 
DB31 
Yl 
32 


YSCAN 
Serial input is logic '0' 
YO 
32 


serial scan testing 
control 
signals 


Two internal 
control 
signals 
will be generated 
to initiate scan testing. 
The first signal, 
SCANZ, 
will be decoded 
from the state BPA4-BPAO 
is 1F hex and TP1-TPO 
is 0 hex. When SCANZ 
is active 
(low), all ten scan registers 
listed in Table 24 will be in a serial-in, 
serial-out 
mode. They may be simultaneously 
read out and reloaded 
with 
their expected 
contents 
by a series of 40 clocks. 
The serial input and serial output 
pin assignments 
are given 
in Table 24. 


The second 
signal, PROBEZ, 
will be decoded 
from the state BPA4-BPAO is 1E hex and TP1- TPO is 0 hex. When 
PROBEZ 
is active (low), seven ofthe ten registers 
listed in Table 24 are in a normal operating 
mode. The normal 


mode 
means 
that the registers 
are fully under 
control 
of their respective 
feed-through 
and enable 
pins. The 
V-SCAN 
register 
simply 
samples 
the data being routed to the pins Y31-YO. 


Only DAFMT, 
DBFMT, 
and RFADDR 
are placed 
in a hold mode and the inputs to the device 
normally 
coming 
from DA31-DAO 
and DB31-DBO 
are supplied 
instead 
by the corresponding 
bits stored 
in registers 
DAFMT 
and 
DBFMT 
respectively. 
The 
register 
file addresses 
normally 
supplied 
by way 
of pins WAS-WAO, 
WBS-WBO, 
RCS-RCO, 
RDS-RDO, 
RES-REO, 
and RFS-RFO are instead 
supplied 
by the corresponding 
bits stored 
in the 
RFADDR 
register. 


The 
PROBEZ 
control 
coupled 
with the SCANZ 
control 
allows 
for sequences 
of tests 
using only one pin for 
DA31-DAO, 
one pin for DB31-DBO, 
and one pin for the register 
file addresses. 
Another 
pin would 
be used for 
the output 
bus Y31-YO. This potentially 
reduces 
the probe 
pin count 
by 124 pins or to a total of 84 pins and 
provides 
access to the contents 
of all the major registers 
on the device as listed in Table 24. The sequence 
would 
be as follows: 


1. 
SCANZ 
active: 
load and read all registers 
using 40 clock sequence. 


2. 
PROBEZ 
active: 
clock once to advance 
data one pipeline 
position. 


3. 
SCANZ 
active: 
load and read all register 
using 40 clock sequence. 
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TIMING 
CLOADt 
IOL 
IOH 
VLOAD 


PARAMETERS 
(pF) 
(mA) 
(mA) 
(V) 


ten 
tPZH 
0 


tPZL 
50 
8 
-8 
3 


tdls 


tpHZ 
-8 
tpLZ 
50 
8 
1.5 


tpd 
50 
8 
-8 
* 
t CLOAD includes probes and test fixture capacitance. 
* VLOAD - VOL = 50 Q. where VOL = 0.45 V, IOL = 8 mA. 


IOL 


TIMING 
INPUT 


(See Note A) 


I+- tf 


VOLTAGE WAVEFORMS 
SETUP AND HOLD TIMES 
INPUT RISE AND FALL TIMES 


~ 


--3V 


1.5V 
1.5V 


I 
I 
OV 


~tw-+i 
I 
I 


~ 


I 
3V 


1.5 V 
1.5 V 
____ 
OV 


VOLTAGE WAVEFORMS 


PULSE DURATION 


OUTPUT 
~ 
- 
- 
- 
- 
3 V 
CONTROL 
1.5 V 


INPUT 
L 1.5 V 
~ 
1.5 V 
(low-level 
II 


(See Note A) -..IiI 
1\ 
0 V 
enabling) 
~ 
- 
- 
- 
- 
0 V 


'---- 
tpZL -.J!+: 
I 


tpd 
14 
.1 
~ 
tpd 
1 
I 
tPLZ~ 
1-- 


1 
1/ 
I 
1- - 
VOH 
1 
LI 


I~~~:~i 
1 T 
1.5V I 
~V 
WAVEFORM 
1 ---1-0~- - -I I::::-=-=-=- 
~~.5V 
--1-- 
I 
VOL 
(See Note B) 
VOL + 10~ 
1L':~L~ 
l~VOL 
1_., 
~ 
tpd 
1.1 


~~ 
1 
~~- 
~ 
I 
F 
VOH 
tpZH -+I 
I+-,. 
••• I~.- - 
VOH 
OUT.OFO·PuHTApSUET 
\: 
1.5 V 
1.5 V 
- 
- 
...J 
\. 
VOH-10too 
\VOH-10% 
'----,- 
- 
- 
VOL 
WAVEFORM 2 
'-.:.:.- 
-1.5 
V 
(See Note C) 
------------ 
OV 


NOTES: A. Phase relationships between waveforms were chosen arbitrarily. All input pulses are supplied by pulse generators having the following 
characteristics: 
PRR = 1 MHz. Zo = 50 Q, tr " 6 ns, tf" 
6 ns. 
B. Waveform 
1 is for an output with internal conditions such that the output is low except when disabled by the output control. 


C. Waveform 2 is for an output with internal conditions such that the output is high except when disabled by the output control. For tpLZ 
and tpHZ. VOL and VOH are measured values. 
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mUltiplier 
instruction 
set 


Multiplier operations are under control of instruction bits Mia-MIS. Bit a signifies integer or floating-point 
operations as follows: 


INSTRUCTION 
BITS 
MULTIPLIER 


M18 
M17-M15 
OPERATIONS 


000 
+(A - B) 


001 
-(A- 
B) 


010 
+(A -IBO 


ot 
011 
-(A -IBO 


100 
+(IAI-IBO 


101 
-(IAI-IB[) 


110 
(2-A 
- B) 


111 
(3 -A 
- B)/2 


000 
LS A - B 


001 
MSA 
- B 


010 
LS A - #B 
1l 
011 
MSA- 
#B 


100 
LS #A - #B 


101 
MSHA- 
#B 


110 
LS M(A) _ M(B) 


111 
MS M(A) - M(B) 
t Floating-Point 
Format 
llnteger 
Format 


NOTE: 
A pass A operation exists for the multiplier by forcing a '1'10 be input on the B input port by selecting SELM(3-2) 
= B'10'. The notation 


"#A" for integer operations means the input value is assumed to be unsigned. The notation "M(A)" means the input value is assumed to 
be unsigned but the MSB (floating-point 
hidden bit) is forced to a 1. 


TEXAs ." 
INSTRUMENTS 


The following 
text gives a more complete 
description 
of each of the multiplier 
opcodes 
and their operation. 
The 


general 
format 
used is: 


STATUS 
N: 
z: 


GOUT/U: 


OVER: 
STEX: 


Sign 
Zero 
Underflow 
Overflow 
Divide-by-Zero 


INSTRUCTION: 
+(A * B) OPCODE 
: MI = 0 hex 


DESCRIPTION: Multiply floating-point. 
The lower 32 bits of the mantissa 
are truncated 
after normalization. 
The 


sign of the product 
is the exclusive-OR 
combination 
of the sign of the operands. 
MNEMONIC: MF 
STATUS 


N: 
The sign of the product 


Z: 
Zero flag is set when either A or B is zero (I.e. the exponent 
= 0 and the sign = 0). Sign and 
exponent 
of the product 
are set to zero. 
COUT/U: 
Underflow 
flag is set when the exponent 
of the product 
is zero or negative. 
Sign and exponent 
of the product 
are set to zero. 
OVER: 
Overflow 
flag is set when the exponent 
of the product 
is larger than FF hex. 


INSTRUCTION: 
-(A * B) OPCODE 
: MI = 1 hex 


DESCRIPTION: 
Multiply 
floating-point 
Negate 
output. 
The 
lower 
32 bit of the mantissa 
is truncated 
after 


normalization. 
The sign of the product 
is the exclusive-NOR 
combination 
of the sign of the operands. 


MNEMONIC: 
MFN 


STATUS 
N: 
The sign of the product. 


Z: 
Zero flag is set when either A or B is zero (I.e. the exponent 
= 0 and the sign = 0). Sign and 
exponent 
of the product 
are set to zero. 


COUT/U: 
Underflow 
flag is set when the exponent 
of the product 
is zero or negative. 
Sign and exponent 
of the product 
are set to zero. 
OVER: 
Overflow 
flag is set when the exponent 
of the product 
is larger than FF hex. 


INSTRUCTION: 
+(A * IBI) OPCODE 
: MI = 2 hex 


DESCRIPTION: 
Multiply 
floating-point 
using absolute 
value of B. The lower 32 bit of the mantissa 
is truncated 


after normalization. 
The sign of the product 
is that of the input in A port. 
MNEMONIC: MFAB 
STATUS 
N: 
The sign of the product. 


Z: 
Zero flag is set when either A or B is zero (I.e. the exponent 
= 0 and the sign = 0). Sign and 


exponent 
of the product 
are set to zero. 
COUT/U: 
Underflow 
flag is set when the exponent 
of the product 
is zero or negative. 
Sign and exponent 


of the product 
are set to zero. 
. 


OVER: 
Overflow 
flag is set when the exponent 
of the product 
is larger than FF hex. 
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INSTRUCTION: 
-(A * IBI) OPCODE 
: MI = 3 hex 
DESCRIPTION: 
Multiply 
floating-point 
using absolute 
value 
of B and Negate 
output. 
The lower 32 bit of the 
mantissa 
is truncated 
after normalization. 
The sign of the product 
is the inversion 
of that of the input in A port. 


MNEMONIC: MFABN 
STATUS 
N: 
The sign of the product. 


Z: 
Zero flag is set when 
either A or B is zero (i.e. the exponent 
= 0 and the sign = 0). Sign and 
exponent 
of the product 
are set to zero. 


COUT/U: 
Underflow 
flag is set when the exponent 
of the product 
is zero or negative. 
Sign and exponent 
of the product 
are set to zero. 
OVER: 
Overflow 
flag is set when the exponent 
of the product 
is larger than FF hex. 


INSTRUCTION: 
+(IAI * IBI) OPCODE 
: MI = 4 hex 
DESCRIPTION: 
Multiply 
floating-point 
using absolute 
value 
of A and B. The lower 32 bit of the mantissa 
is 
truncated 
after normalization. 
The sign of the product 
is O. 


MNEMONIC: MFABS 
STATUS 
N: 
The sign of the product. 


Z: 
Zero flag is set when either A or B is zero (i.e. the exponent 
= 0 and the sign = 0). Sign and 
exponent 
of the product 
are set to zero. 


COUT/U: 
Underflow 
flag is set when the exponent 
of the product 
is zero or negative. 
Sign and exponent 
of the product 
are set to zero. 
OVER: 
Overflow 
flag is set when the exponent 
of the product 
is larger than FF hex. 


INSTRUCTION: 
-(IAI * IBI) OPCODE 
: MI = 5 hex 
DESCRIPTION: 
Multiply 
floating-point 
using absolute 
value of A and B Negate 
output. 
The lower 32 bit of the 
mantissa 
is truncated 
after normalization. 
The sign of the product 
is 1. 


MNEMONIC: MFABSN 
STATUS 
N: 
The sign of the product. 


Z: 
Zero flag is set when either A or B is zero (i.e. the exponent 
= 0 and the sign = 0). Sign and 
exponent 
of the product 
are set to zero. 


COUT/U: 
Underflow 
flag is set when the exponent 
of the product 
is zero or negative. 
Sign and exponent 
of the product 
are set to zero. 


OVER: 
Overflow 
flag is set when the exponent 
of the product 
is larger than FF hex. 
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INSTRUCTION: (2 - A * B) OPCODE : MI = 6 hex 
DESCRIPTION: 
Two Minus A Times 
B. The output 
is equal to 2 minus the product 
of A and B with the lower 
32 bit of the mantissa 
truncated 
after normalization. 
This is the special 
instruction 
used in Newton-Raphson 


approximation 
of 1/B. The product 
A * B must be approximately 
1 for this instruction 
to execute 
properly 
since 


the normalization 
circuit for the subtract 
operation 
is based on this assumption 
and cannot 
normalize 
the result 


for unrestricted 
values 
of A and B. 
MNEMONIC: NRDIV 
STATUS 


N: 
The sign of the product. 


Z: 
Zero flag is set when either A or B is zero (Le. the exponent 
= 0 and the sign = 0). Sign and 
exponent 
of the product 
are set to zero. 
COUT/U: 
Underflow 
flag is set when the exponent 
of the product 
is zero or negative. 
Sign and exponent 
of the product 
are set to zero. 


OVER: 
Overflow 
flag is set when the exponent 
of the product 
is larger than FF hex. 


STEX: 
Divide by Zero. This status 
bit is set when A or B are zero (Le. exponent 
is zero). This will be 


the case when the exponent 
of B is 0 hex or 1 hex. The first case is a divide 
by zero and the 
second 
causes 
an intermediate 
overflow 
for which the seed rom sends out a zero exponent 
on 
A, which 
sets this flag. 


INSTRUCTION: (3 - A * B)/2 OPCODE : MI = 7 hex 
DESCRIPTION: 
One Half of Three 
Minus A Times 
B. The output 
is equal one half of 3 minus the product 
of A 


and B with the lower 32 bit of the mantissa 
truncated 
after normalization. 
This is the special 
instruction 
used in 


Newton-Raphson 
approximation 
of 1/Square 
Root of B. The product 
A * B must be approximately 
1 for this 


instruction 
to execute 
properly 
since 
the 
normalization 
circuit 
for the 
subtract 
operation 
is based 
on this 


assumption 
and cannot 
normalize 
the result for unrestricted 
values 
of A and B. 


MNEMONIC: NRSQR 
STATUS 
N: 
The sign of the product. 


Z: 
Zero flag is set when either A or B is zero (Le. the exponent 
= 0 and the sign = 0). Sign and 
exponent 
of the product 
are set to zero. 
COUT/U: 
Underflow 
flag is set when the exponent 
of the product 
is zero or negative. 
Sign and exponent 
of the product 
are set to zero. 
OVER: 
Overflow 
flag is set when the exponent 
of the product 
is larger than FF hex. 


STEX: 
Divide by Zero. This status 
bit is set when A or B are zero (Le. exponent 
is zero). This will be 


the case when the exponent 
of B is 0 hex or 1 hex. The first case is a divide-by-zero 
and the 


second 
causes 
an intermediate 
overflow 
for which the seed rom sends out a zero exponent 
on 
A, which 
sets this flag. 


INSTRUCTION: LS A * B OPCODE : MI = 8 hex 
DESCRIPTION: 
Multiply 
Integer Single precision. 
The output is the least significant 
32 bits of the 64 bit product 


of A and B. A and B are assumed 
to be signed 
integer 
and so is the result. The exponent 
is unpredictable. 
MNEMONIC: MIS 
STATUS 
N: 
Z: 


COUT/U: 


OVER: 
STEX: 


The sign of the product. 
Zero flag is set when either A or B is zero (Le. A31-AO=0 
or B31-BO=0). 


Underflow 
flag is set to '0'. 


Overflow 
flag is set when 
bits 66 through 
31 of the product 
is neither 
all '0' nor all '1'. 


Divide-by-Zero 
flag is set to '0'. 
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INSTRUCTION: 
MS A * B OPCODE 
: MI = 9 hex 
DESCRIPTION: 
Multiply Integer Double precision. 
The output is the most significant 
32 bits of the 64-bit product 
of A and B. A and B are assumed 
to be signed 
integers 
and so is the result. The exponent 
is unpredictable. 


MNEMONIC: 
MID 
STATUS 
N: 
The sign of the product. 


z: 
Zero flag is set when either A or B is zero (Le. A31-AO=0 
or B31-80=0). 
COUT/U: 
Overflow 
flag is set to '0'. 
OVER: 
Underflow 
flag is set to '0'. 
STEX: 
Divide-by-Zero 
flag is set to '0' 


INSTRUCTION: 
LS A * #B OPCODE 
: Ml = A hex 
DESCRIPTION: 
Multiply 
Integer 
Single 
precision 
with B Unsigned. 
The output 
is the least significant 
32 bits 
of the 64-bit product of A and B. A is assumed 
to be signed integer and B is assumed 
to be an unsigned 
number. 


The result is a signed 
integer. 
The exponent 
is unpredictable. 
MNEMONIC: 
MISBU 
STATUS 
N: 
The sign of the product. 


Z: 
Zero flag is set when either A or B is zero (Le. A31-AO=0 
or B31-80=0). 


COUT/U: 
Overflow 
flag is set when bits 66 through 
31 of the product 
is neither 
all '0' nor all '1'. 
OVER: 
Underflow 
flag is set to '0'. 


STEX: 
Divide-by-Zero 
flag is set to '0'. 


INSTRUCTION: 
MS A * #B OPCODE 
: MI = B hex 
DESCRIPTION: 
Multiply 
Integer 
Double 
precision 
with B Unsigned. 
The output 
is the most significant 
32 bits 
of the 64-bit product 
of A and B. A is assumed 
to be signed integer and B is assumed 
to be an unsigned 
number. 
The result is a signed 
integer. The exponent 
is unpredictable. 
MNEMONIC: 
MIDBU 
STATUS 
N: 
The sign of the product. 


Z: 
Zero flag is set when either A or B is zero (Le. A31-AO=0 
or B31-BO=0). 


COUT/U: 
Underflow 
flag is set to '0'. 


OVER: 
Overflow 
flag is set to '0'. 


STEX: 
Divide-by-Zero 
flag is set to '0'. 


INSTRUCTION: 
LS #A * #B OPCODE 
: MI = C hex 
DESCRIPTION: 
Multiply 
Integer Single precision 
with A and B Unsigned. 
The output 
is the least significant 
32 
bits of the 64-bit 
product 
of A and B. A and B are assumed 
to be unsigned 
integers 
and so is the result. 
The 
exponent 
is unpredictable. 
MNEMONIC: 
MISU 
STATUS 
N: 
The sign of the product, 
is zero. 
Z: 
Zero flag is set when either A or B is zero (Le. A31-AO=0 
or B31-BO=0). 
COUT/U: 
Underflow 
flag is set to '0'. 
OVER: 
Overflow 
flag is set when 
bits 66 through 
32 of the product 
is neither 
all '0' nor all '1'. 


STEX: 
Divide-by-Zero 
flag is set to '0'. 
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INSTRUCTION: MS #A * #B OPCODE : MI = 0 hex 
DESCRIPTION: 
Multiply 
Integer 
Double 
precision 
with A and 8 Unsigned. 
The output 
is the most significant 
32 bits of the 64-bit product 
of A and 8. A and 8 are assumed 
to be unsigned 
integers 
and so is the result. The 
exponent 
is unpredictable. 


MNEMONIC: MIDU 
STATUS 
N: 
The sign of the product, 
is zero. 
Z: 
Zero flag is set when either A or 8 is zero (Le. A31-AO=0 
or 831-80=0). 


COUT/U: 
Underflow 
flag is set to '0'. 
OVER: 
Overflow 
flag is set to '0'. 


STEX: 
Divide-by-Zero 
flag is set to '0'. 


INSTRUCTION: LS M(A) * M(B) OPCODE : MI = E hex 
DESCRIPTION: 
Multiply 
Integer 
Single 
precision 
with A and 8 interpreted 
as floating-point 
mantissas. 
The 
output is the least significant 
32 bits of the 64-bit product 
of A and 8. A and 8 are assumed 
to be mantissas 
from 
floating-point 
numbers 
in the internal 
40-bit floating-point 
format. 
The hidden 
bit is inserted 
and the sign bit is 
ignored 
by this instruction. 
No normalization 
is performed. 
The exponent 
is unpredictable. 


MNEMONIC: MISM 
STATUS 
N: 
The sign of the prOduct, is zero. 


Z: 
Zero flag is set when either A or 8 is zero (Le. A31-AO=0 
or 831-80=0 
before the hidden 
bit is 
inserted 
into bit 31). 
COUT/U: 
Underflow 
flag is set to '0'. 


OVER: 
Overflow 
flag is set to '0'. 
STEX: 
Divide-by-Zero 
flag is set to '0'. 


INSTRUCTION: MS M(A) * M(B) OPCODE : MI = F hex 
DESCRIPTION: 
Multiply 
Integer 
Double 
precision 
with A and 8 interpreted 
as floating-point 
mantissas. 
The 
output is the most significant 
32 bits of the 64-bit product 
of A and 8. A and 8 are assumed 
to be mantissas 
from 
floating-point 
numbers 
in the internal 
40-bit floating-point 
format. 
The hidden 
bit is inserted 
and the sign bit is 
ignored 
by this instruction. 
No normalization 
is performed. 
The exponent 
is unpredictable. 


MNEMONIC: MIDM 
STATUS 


COUT/U: 


OVER: 
STEX: 


N: 
The sign of the product, 
is zero. 
Z: 
Zero flag is set when either A or 8 is zero (Le. A31-AO=0 
or 831-80=0 
before the hidden 
bit is 
inserted 
into bit 31). 


Underflow 
flag is set to '0'. 
Overflow 
flag is set to '0'. 


Divide-by-Zero 
flag is set to '0'. 
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Newton-Raphson 
2 - A * B 


The 
multipier 
instruction 
NRDIV 
performs 
the 
first 
half 
of 
Newton-Raphson 
division 
cycles. 
A cycle 
of 
Newton-Raphson 
division 
consists 
of evaluating 
the equation 
X * (2 - B * X) where 
B is the divisor 
and X is a 
close approximation 
of 1/B. This is performed 
in two steps: 


1. 
2 - OPA * OPB : where 
OPA = X and OPB = B 


2. 
OPA * OPB : where 
OPA = X and OPB = the result of step 1 


Step two is just a standard 
floating-point 
multiplication 
and will not be discussed 
further. 
Step one is the special 
NRDIV 
instruction 
that requires 
further 
description 
to define its operation 
and to understand 
fUlly its correct 
use 
and results. 
These 
steps are repeated 
until a quotient 
of sufficient 
accuracy 
is obtained. 


The NRDIV 
instruction 
can be used to perform 
the floating-point 
division 
of NB for all possible 
values 
of A and 
B with certain restrictions 
such as overflow 
cases. This instruction 
will not function 
correctly 
for all possible values 
of input operands 
OPA and OPB. More specifically, 
the subtraction 
of the product OPA * OPB from the quantity 
2. 


This 
is acceptable 
since 
the desired 
operation 
is to perform 
the division 
of NB and the restrictions 
on this 
individual 
instruction 
do not restrict 
the allowable 
values 
of A and B. The restrictions 
for the NRDIV 
instruction 
are added to simplify 
its hardware 
implementation. 
To allow a general 
2 - A * B instruction 
would have required 
the addition 
of a complete 
normalization 
stage to the multiplier, 
thereby 
slowing 
its operation 
and increasing 
its 
die size. 


The 
restriction 
on the operands 
to the 
NRDIV 
instruction 
is that 
one 
operand 
must 
be approximately 
the 
reciprocal 
of the other 
operand. 
It does not matter 
which 
operand 
is the smaller 
operand, 
only that they are 
reciprocals 
of each other. This restricts 
the result of their product 
to be approximately 
1. If the on device 
seed 
(see Table 6) is used to generate 
a seed value for an approximate 
reciprocal 
of the divisor, then this condition 
will be fulfilled 
for all legal values 
of the divisor 
(values with a biased exponent 
of 0 or 1 are not legal values 
and 
will cause an overflow 
condition. 
The accuracy 
of the seed generated 
is approximately 
8 bits and will produce 
a product 
that is more than sufficiently 
close to 1 to allow 
proper 
operation 
of this instruction. 
The minimum 
accuracy 
required 
for the reciprocal 
seed to function 
properly 
is approximately 
2 significant 
binary 
bits. Seeds 
generated 
by other methods 
must at least meet this requirement 
for the NRDIV 
instruction 
to operate 
properly. 


The need for this restriction 
is not in the multiplication 
part but in the subtraction 
part of the instruction. 
The 
multiplication 
of any operands 
A and B will require a mantissa 
normalization 
of at most one bit. Therefore, 
most 
multipliers 
do not contain 
normalization 
circuitry 
that will handle 
more than this one bit shift of the mantissa. 
A 
subtraction 
of any general 
operand 
A and B may require 
multiple 
bit shifts of the mantissa 
both to align the radix 
points 
initially 
and normalize 
the result 
afterwards. 
With the restriction 
on the input operands 
as above 
and 
knowing 
that the subtraction 
will always 
be from the fixed constant 
2, the operation 
is constrained 
to require 
no 
more than a one bit shift of the mantissa 
result. The result of the product 
will be approximately 
1, either slightly 
above 
or slightly 
below, 
and the subtraction 
from 
2 will result 
in a quantity 
slightly 
below 
or slightly 
above 
1, 


respectively. 
Normalization 
will require 
at most a one-bit 
shift as is already 
available 
in the multiplier. 


The implementation 
of the actual subtraction 
takes advantage 
of one more observation 
that requires 
no further 
restrictions 
on the operands 
but allows futher 
simplification 
of the hardware. 
The subtraction 
implementation 
eliminates 
the need for a hard-coded 
constant 
2 by observing 
that a 2s complement 
on the mantissa 
performs 
exactly 
the same operation. 
This is evident 
after examination 
of appropriate 
bit strings. 
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double-precision 
multiplication 
and addition 


Double-precision (64-bit) operations are supported in the multiplier through eight fixed-point instructions that 
allow output of the upper or lower half of a 64-bit result. The ALU supports double-precision through instructions 
that allow multiple-precision fixed-point addition and 2s complement operations with carries propagated though 
an internal carry flip flop. The conversion and shift operations of the ALU are primarily single precision (32-bit) 
but also have application in double-precision functions. Floating-point double-precision operations may be done 
by decomposition into these single-precision fixed-point operations. Status will need to be selected from the 
results of the individual single-precision operations when available or specially generated by extra microcode 
steps. The select function of the ALU will aid in developing branchless microcode to handle exceptions. 


Double-precision addition and subtraction of fixed-point values is done with the use of a carry flip-flop (CFF). 
The instructions that use this carry are: 


1. 
A + B + CFF A1 = 09 hex 


2. 
A + B+CFF 
AI = OBhex 


3. 
A + CFF AI = 11 hex 


A double-precision or multiple-precision addition operation begins with a normal add with carryin such as 
A + B + CIN on the lowest order 32-bit section of the operands. The next higher 32-bit sections are then added 
with instruction 1 above. Subsequent sections are also added using instruction 1. Instruction 2 would be used 
for subtraction and instruction 3 for one or 2s complement. The carry flip-flop is set by all of the integer arithmetic 
operations. The low-order addition must be performed immediately before the high-order addition in order to 
propagate the carry properly. 


Double-precision multiplication is a composite of multiplication and addition operations. The double-precision 
operands must be stored separately as high and low values in the register file. The 32-bit integer field of the 
register holds the data. The exponent field is unused. A partial product algorithm isthen used to multiply the high 
and low operand values and sum the outputs. For unsigned operands, this process is quite standard and does 
not need to be discussed further. Status must be determined by extra microcode steps as appropriate for the 
needs of the user's algorithm. 


For signed double-precision multiplication, each operand may be treated by the multiplier as an unsigned 
fixed-point value, then correction cycles in the ALU would be used to correct the result. This method of signed 
multiplication that will implicity extend the values by the sign is outlined as follow: 


Double-precision integer: 


A = ISI· 
63 bits. . . . . .. 
1 = 2+63 + I 
63 bits 
·1 


<--2s 
Complement--> 
<---Unsigned---> 


Implicit sign extended double-precision integer: 


A = 1SI· 
63 bits 
I = - 2+64 + I 
63 bits 
·1 


<--2s 
Complement--> 
(not stored) <---Unsigned,---> 
(S bit not stored) 
= - 2+64 + 2+63 + I 
63 bits 
·1 
= - 2+63 + I 
63 bits 
·1 


If A and Bare 2s complement 64-bit integers and UA and UB are the unsigned interpretation of these numbers 
as shown above, then the operations to be done in double precision are: 


A * B= (- 2+64 + UA) * (- 2+64 + UB) 


= (UA * UB) -A - B 
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Each portion of each implicitly unsigned operand is multiplied by both portions of the other operand. The high- 
and low-order results of each multiplication are stored in the register file. Each portion is then added to 
corresponding pieces of the same magnitude in the ALU to produce a 128-bit result. This is essentially the 
unsigned double-precision function mentioned previously. Then the implicit sign of A is multiplied by B (high and 
low portions) and subtracted from the result by a conditional subtract operation. The implicit sign of B is also 
mUltiplied by A and subtracted to complete the double-precision multiply. The steps involved are shown below 
(AH,AL,BH,BL stand for A: high-order part, A: low-order part, B: high-order part, and B: low-order part, 
respectively) : 


(unsigned integer) 131 
01 


<----A,L---->, 


(unsigned integer) 131 
01 


x <----B,L----> 


163 
32131 
01 


<----·P1 
H,----> 
<----P1 
L----> 


Now the partial product results must be summed with other partial products of the same magnitude: 


Step 5: 


163 
32131 
01 
<---P1 H---> <---P 
1L---> 
163 
32131 
01 
<---P2H---><---P2L---> 
163 
32131 
01 
<---P3H---><---P3L---> 
163 
32131 
01 
+ <---P4H---><---P4L---> 


1128 
96195 
64163 
32131 
01 


<--P&--><---P7--><--P6--><--P5--> 
163 
32131 
01 
( 
- 
<---AH--><---AL--> 
If sign(B) is negative) 


1128 
96195 
64163 
32131 
01 


<--P1 
(}--><--P9-><--P6--><-' 
--P5--> 


163 
32131 
01 


( 
- 
<---BH--><--BL--> 
If sign(A) is negative) 


1128 
96195 
64163 
32131 
01 


<--P12---><---P11--><--P6--><---P5---> 


The conditional correction steps could be performed by branches in microcode or by using the select instruction 
of the ALU to select the adjusted or unadjusted quantity. The selection could be performed by shifting the sign 
bit of A or B into the LFF and feeding this status output back to the CIN pin to control the selection. 
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If the input operands are interpreted as integers, then the binary point of the result is at the extreme right. The 
binary point in the final result would be located between bits 127 and 126 for fractional operands. Other 
interpretations of the input operands would result in different interpretations of the result and status as 
appropriate. A desired section of the 128-bit string could be selected through use of the shift and logical 
operations in the ALU if desired. 


As mentioned, status for the double-precision operations will depend on the interpretation of the operands and 
may require extra steps to check the results. In general, status from each individual step will not be very useful, 
but status generated by the final multiply or addition steps may provide part of the needed information. For 
example, overflow when from the multiplication of the low-order pieces in double-precision does not necessarily 
indicate the entire operation will overflow. 


This description provides the outline for only one possible method to perform double-precision operations. 
Depending on the constraints of the algorithm, other variations may be useful and faster. For floating-point 
operations the exponent extract and conversion instructions may prove useful in developing relatively efficient 
double-precision algorithms. In all cases, the number of cycles for double-precision operations will be many 
more times than needed for single-precision operations, which are the intended primary usage of the device. 


double-precision 
Newton-Raphson 
division 


Support for Newton-Raphson 
division 
in double-precision 
is provided through 
use of multiple cycle 
double-precision mUltiplication and addition operations outlined above. The seed used for double-precision is 
typically the result of a single-precision floating-point division. The conversion instructions of the ALU will allow 
the single-precision result to be converted to different formats or extracted into a multiple word dOUble-precision 
format as appropriate for the algorithm being performed. 


One additional Newton-Raphson double-precision cycle will be needed to generate a double-precision result 
for most needs. This process may be done using the methods for double-precision multiplication and addition. 
Other cycles may be needed to perform correction operations or format the results depending on the 
requirements of the application and the algorithm used. 


For certain graphics applications, a division of two single-precision quantities producing a double-precision 
value of at least 50-bit accuracy is required. The output will be fixed-point format with 32 binary digits to the left 
of the radix point and 32 to the right. The left 32 bits are output as one word and the right as another word. The 
algorithm used is essentially the standard partial products method but the number of cycles has been reduced 
by taking advantage of the special constraints of the application. The steps used are listed in Table 25, which 
includes a description of the partial products simplification actually used and other notes. 


The general assumptions made for this algorithm are: 


Accuracy to 64 bits is not necessary, 60 bits is sufficient. This allows the partial product procedure to be 
simplified. 


A and B are positive. All the input signs are ignored. 


Output is to be unsigned 32.32, total 64-bit result, 32 bits on either side of the radix point. 


Binary exponent of input is -28 to +31. 


Status can be obtained from the single-precision steps or will be generated separately. 
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REGISTER 
VARIABLE 
REGISTER 
VARIABLE 


RFOO 
A 
RF08 
PH 


RFOl 
B 
RF09 
SH1, SHF 


RF02 
XO, Xl, X2 
RFOA 
SH2, SHM 


RF03 
CL, SPL 
RFOB 
- 


RF04 
CH, SPH 
RFOC 
255 


RF05 
DL 
RFOD 
- 


RF06 
DH 
RFOE 
1 


RF07 
PL 
RFOF 
0 


Partial 
Products: 


The double-precision 
cycle completes 
the equation: 


A x X2 x (2-(8 x X2)) 


This 
requires 
two 
32-bit 
by 32-bit 
multiplications, 
one 
64-bit 
2s complement, 
and 
one 
64-bit 
by 64-bit 
multiplication 
as follows: 


A.X2 
Steps: 
M8,M9 - 
A and X2 are the 32-bit mantissas 
from the single-precision 
NR division. 
These 
steps 
insert 
the hidden 
bit and multiply 
saving 
the high and low 32 bits of the result. The result is not normalized 
nor is the 
leading 
1 made into a hidden 
bit. 


B.X2 
Steps: 
M6,M7 
- 
8 and X2 are the 32-bit 
mantissas 
from the single-precision 
NR division. 
These 
steps 
insert 
the hidden 
bit and multiply 
saving the high and low 32 bits of the result. The result is not normalized 
nor is the 
leading 
1 made into a hidden 
bit. 


2-B.X2 
Steps: A7,A8 - 
The 2 minus step is accomplished 
by 2s complementing 
the 64-bit 8. 
X2 product. 
This can be 
done since 8 • X2 is close to 1. 


A • X2 • (2 - (B • X2» 
Steps: 
M10,M11,M12,M13 
multiply 
the 
high 
and 
low 32-bit 
pieces 
of the two 64-bit 
quantities 
A.X2 
and 
2 - (8 • X2) to produce 
the high 64 bits of the 128-bit result. The low 64 bits of the 128-bit result are not needed 


and missing 
carry outs will not affect the required 
accuracy 
in the high 64 bits. Again, 
an accuracy 
of 60 bits is 
sufficient 
for this application. 


Steps: A 12,A 13,A 14,A 15 add the pieces 
of the partial 
product 
that contribute 
to the high 64 bits of the result. 
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DD 
GD 


+ 
aD 


+ B B .- 
NOT COMPUTED 
-. 


I A. X2 • (2 - (B .)(2» I 
• 
I 
RADIX POINT 


The tesult 
is an un normalized 
64-bit fixed-point 
quantity 
with the radix point to the extreme 
left. Formatting 
for 
output consists 
of shifting 
this quantity 
to the right according 
to the shift quantity 
computed 
from the exponents 
of A and X2 in steps A4, A5, A6, A9, and A 10. The actual 
shifting 
is performed 
in the multiplier 
by converting 
the shift count to a power of 2. Steps 
M15, M16, M17, A 16, A 17, A 18, and A 19 perform 
the shift needed. 


double-precision 
Newton-Raphson division cycle support 


SINGLE(A)/SINGLE(B) 
= DOUBLE(Q) 


Cycle 1: Load A and B 
MULT: idle 
ALU: idle 


A is loaded 
through 
DA and stored 
in RFOO; B is loaded 
through 
DB and stored 
in RF01. Add one more cycle 
if A and B must be coverted 
from fixed-point 
to floating-point 
format. Add another 
cycle, if A and B will be loaded 
only through 
a single 
port. 


single-precision 
mode 


Cycle 2: NR single-precision 
division 2 - XO* B 
MULT: NRDIV(SEED,RF01) 
=> MULT B 
ALU: idle 


First half of initial NR cycle. Load constant 
value 255 into the register file. The seed value is generated 
by applying 
RF01 to the seed (see Table 6). 


Cycle 3: NR single-precision 
division XO(2 - XO* B) => X1 
MULT: SEED*MULT 
B => MULT B,RF02 
ALU: idle 


Second 
half of initial NR cycle. 
Load constant 
values 
1 and O. 
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Cycle 4: NR single-precision 
division 
2 - B * X1 


MULT: NRDIV(RF01 
,MULT B) => MULT B 


ALU: E21(A) => ALU B 


First half of second 
NR cycle. 
Extract 
exponent 
of A for later shift operation. 


Cycle 5: NR single-precision 
division 
X1 (2 - B * X1) => X2 
Add EXP(A)-255 
=> SH1 


MULT: RF02*MULT 
B => MULT B,ALU A,RF02 


ALU: -255+ALU 
B => RF09 


Second 
half of second 
NR cycle. Result from MULT is accurate 
to approximately 
31 bits. Constant 
is subtracted 
from the exponent 
to convert 
it to a shift quantity. 


double-precision 
cycle 


Cycle 6: Low half of mantissa product 
B * X2 
EXP(X2) => SH2 


MULT: MISM(RF01 
,MULT B) => ALU A 
ALU: 
E21(ALU A) => RFOA 


Insert the hidden bit and multiply the mantissas 
of Band X2 selecting 
the low-order 
32 bits. Extract the exponent 
of X2 for later shift operation. 


Cycle 7: High half of mantissa product 
B * X2 
Complement 
low half of B * X2 => CL 


MULT: MIDM(RF01,RF02) 
=> ALU A 


ALU: 
NOT(ALU 
A)+CIN 
=> RF03 


Insert the hidden 
bit and multiply 
the mantissas 
of Band 
X2 selecting 
the high-order 
32 bits. Complement 
low 
half of B*X2, CIN=1. 


Cycle 8: Low half of mantissa product A * X2 => DL 
Complement 
high half of B * X2 => CH 
MULT: MISM(RFOO,RF02) 
=> RF05 


ALU: NOT(ALU 
A)+CFF 
=> RF04 


Insert the hidden 
bit and multiply 
the mantissas 
of A and X2 selecting 
the low-order 
32 bits. Complement 
high 
half of B*X2. 


Cycle 9: High half of mantissa product A * X2 - DH 
Add EXP(A) + EXP(X2) - 255 => SHF 
MULT: MIDM(RFOO,RF02) 
=> RF06 


ALU: RF09+RFOA 
=> ALU B,RF09 


Insert the hidden 
bit and mUltiply the mantissas 
of A and X2 selecting 
the high-order 
32 bits. Finish 
adding 
exponents 
and constant 
for shift value 
(SH1 + SH2 - 255). 


Cycle 10: High piece of partial product 
CH * DL => PL1 
Shift a 1 left by SHF => SHM 


MULT: HIGH(RF04 
* RF05) 
=> RF07 
AlU: 
SLL(1, 
ALU B) => RFOA 


Multiply 
the partial product 
to get first piece of the low half. Convert 
the shift factor 
(SHF) to a power of 2 to use 
the multiplier 
for shifting. 
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Cycle 11: High piece of partial product 
CL * DH => PL2 
MULT: HIGH(RF03 * RF06) => ALU A 
ALU: idle 


Multiply the partial product to get second piece of the low half. 


Cycle 12: High piece of partial product 
CH * DH => PH 
Add low pieces PL2 + PL1 => PL 
MULT: HIGH(RF04 * RF06) =<> ALU A 
ALU: ALU A + RF07 =<> RF07 


Multiply the partial product to get the only piece of the high half. Add the first two low pieces of the partial product 
with a carry out. 


Cycle 13: Low piece of partial product CH * DH => PL3 
Add high piece and carry PH + CFF => PH 
MULT: LOW(RF04 * RF06) => ALU A 
ALU: ALU A + CFF => RF08 


Multiply the partial product to get last piece of the low half. Add the carry out from the low piece to the high piece. 


Cycle 14: Add last low piece PL3 + PL =<> PL 
MULT: idle 
ALU: ALU A + RF07 => MULT A, RF07 


Complete the addition of the three low pieces of the partial product with a carry out. 


Cycle 15: Shift low partial product 
PL * SHM => SPL 
Add high piece and carry PH + CFF => PH 
MULT: HIGH(MULT A * RFOA) => RF03 
ALU: RF08 + CFF => MULT A,RF08 


Use the multiplier as a simple shifter to shift the low result of the partial product for output formatting. Add the 
carry out to the high result of the partial product. 


Cycle 16: Shift high partial product 
PH * SHM => SPM 
Rotate left the shift factor ROL(SHF) => LFF 
MULT: LOW(MULT A * RFOA)=> ALU A 
ALU: ROL(RF09) => LFF 


Multiplier shift of high partial product to obtain the fill bits for the shifted low partial product. Rotate the shift value 
left in the ALU to set the LFF.This clears the LFF, if the shift value was positive or zero; and sets the LFF, if it 
was negative. 


Cycle 17: Shift high partial product 
PH * SHM => SPH 
Combine low piece and fill bits OR(SPM, SPL) => SPL 
MULT: HIGH(RF08 * RFOA)=> ALU A, RF04 
ALU: OR(ALU A, RF03) => RF03 


Multiplier shift the high partial product to obtain the high result. OR the low result with the fill bits to obtain the 
low result. The OR operation does not destroy the setting of the LFF. 
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Cycle 18: Select high result for output SEL(SPH, 0) = Y 
MULT: idle 


ALU: SEL(ALU 
A, RFOF) = SUM 


Select the result for the high half of the result (integer part) based on the setting of the LFF. The shift value being 
negative 
indicates 
the result must be shifted 
an extra 32 bits to the right so the high result should 
be all zeroes. 


Cycle 19: Select low result for output SEL(SPL, SPH) = Y 
Output high result on Y 
MULT: idle 


ALU: SEL(RF03, 
RF04) = SUM 


Select the result for the low half of the result (fraction 
part) based on the setting of the LFF. The shift value being 
negative 
indicates 
the result must be shifted 
an extra 32 bits to the right, so the low result should 
be the high 
half. The high result will now be available 
at the Y port. 


Cycle 20: Output high result on Y 
MULT: idle 
ALU: idle 


The low result will now be available 
at the Y port. 
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ALU operations are under control of instruction bits AI4-AIO. Bits 3 and 4 signify integer or floating-point 
operations as described in Table 27. 


INSTRUCTION 
BITS 
ALU OPERATIONS 
A14·A13 
A12-A10 


000 
A+B 


001 
A-B 


010 
Compare A, B (using A - B) 


011 
Pass A 
oot 
100 
Pass (-A) 


101 
Floating-Point 
to Integer Conversion 


110 
Floating-Point 
to Fraction Conversion 


111 
Integer to Exponent Conversion 


000 
A + B + CIN 


001 
A + B + CFF; 


010 
A + B + CIN 


011 
A + B + CFF; 


01§ 
100 
A + B + CIN 


101 
Pass NB on CIN 


110 
MIN/MAX on CIN 


111 
A+CIN 


000 
A+CIN 


001 
A + CFF; 


010 
AAND 
B 


011 
AAND 
B 


10§ 
100 
AXOR 
B 


101 
AORB 


110 
ANAND 
B 


111 
A NOR B 
t Floating-point format 
; The carry flip-flop CFF relains the carry-out status COUT of the previous ALU 
result for one cycle. 
§ Integer/Logical 
format 
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INSTRUCTION 
BITS 


A14·A13 
A12·A10 
ALU OPERATIONS 


000 
Shift Right Logical A (clear L Flip-Flop)t 


001 
Shift Right Arthimetic A 


010 
Rotate Right B~ 


011 
Rotate Left B~ 


10§ 
100 
Shift Left Logical A (set L Flip-Flop)t 


101 
Exponent to integer comversion 


110 
Integer to floating-point 
conversion 


111 
Fraclionallo 
floating-point 
conversion 


t The carry flip-flop CFF retains the carry-out 
status COUT of the previous ALU 
result for one cycle. 
* SRLJSLL - 
Barrel shift capability (shift count at B port). 
§ Integer/Logical 
format 
~ ROR/ROL - 
Rotate left or rotate right by one bit. The bit lost is stored in the link 
fkip-flop. The bit gained is output from the link flip-flop (see Figure 12). 


C:~~~L~F~L~IP~._F 
__L_Op~~~~r ~_I_S_H_IFT_R_E_G_I_ST_E_R_1 


The following 
text gives 
a more 
complete 
description 
of each 
of the ALU opcodes 
and their operation. 
The 


general 
format 
used is: 


STATUS 
N: 
z: 


COUT/U: 


OVER: 
STEX: 


Sign 
Zero 
Underflow 
Overflow 
Divide-by-Zero 


INSTRUCTION: 
A + B OPCODE 
: AI = 0 hex 


DESCRIPTION: Addition 
of floating-point 
operands. 
The result is a 31-bit mantissa 
(not rounded), 
sign bit, and 


an 8-bit exponent 
in Internal 
single 
precision 
format. 


MNEMONIC: ADDF 
STATUS 
Sign of the result. 
Set when the result is zero. 
Underflow 
occurs 
when, 
in order to normalize 
the mantissa 
after the addition 
operation, 
the 
biased exponent 
is less than 1. The result output from the ALU on underflow 
is a floating-point 


zero. 
Overflow 
occurs 
when, 
in order to normalize 
the mantissa 
after the addition 
operation, 
the 
biased 
exponent 
exceeds 
255 decimal. 
During an overflOW, the resulting 
output from the ALU 


has the correct 
mantissa 
and sign. However, 
the exponent 
wraps 
around. 


N:z: 


COUT/U: 
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INSTRUCTION: A· B OPCODE : AI = 1 hex 
DESCRIPTION: 
Subtraction 
of floating-point 
operands. 
The result is a 31-bit mantissa 
(not rounded), 
sign bit, 
and an a-bit exponent 
in internal 
single 
precision 
format. 


MNEMONIC: SUBF 
STATUS 
Sign of the result. 
Set when the result is zero. 
Underflow 
occurs when in order to normalize 
the mantissa 
after the subtraction 
operation 
the 
biased exponent 
is less than 1. The result output from the ALU during an underflow 
is a floating 
point zero. 
Overflow 
occurs 
when 
in order to normalize 
the mantissa 
after the subtraction 
operation 
the 
biased 
exponent 
exceeds 
255. The result output from the ALU during 
an overflow 
has the 
correct 
mantissa 
and sign. However, 
the exponent 
wraps 
around. 


INSTRUCTION: COMPARE A,B OPCODE : AI = 2 hex 
DESCRIPTION: 
Compares 
two floating-point 
operands 
and generates 
two status 
bits (N,Z). The operation 
A - B is used. The status can be evaluated 
identically 
to status from an integer subtraction 
to determine 
A GT B, 
AGE 
B, A EO B, A LT B, and A LE B. The result 
output 
from the ALU is that of a floating-point 
subtraction 
operation. 
This instruction 
varies from the subtraction 
operation 
only in status generated 
(see Tables 
23 and 24). 
MNEMONIC: CMP 
STATUS 


N: 
z: 


COUT/U: 


N: 
z: 
COUT/U: 


OVER: 
STEX: 


EXAMPLE: 


see floating-point 
compare 
section 
see floating-point 
compare 
section 
o 
o 


RAA = H'01 F7654321, 
RAB = H'01 F7654321' 
Result = H'xxxxxxxxxx' 
Status = B'0100' 


RAA = H'01 F7654321' 
RAB = H'0177654321' 
Result = H'xxxxxxxxxx' 
Status = B'1 000' 


INSTRUCTION: PASS A OPCODE : AI = 3 hex 
DESCRIPTION: Pass floating-point 
data on RAA input to the sum bus. The operand 
on the RAB input is ignored 
by this instruction. 
MNEMONIC: PA 
STATUS 
N: 
z: 


COUT/U: 
OVER: 


Sign of A. 


Set when A is zero. 
o 
o 
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INSTRUCTION: PASS (-A) OPCODE : AI = 4 hex 
DESCRIPTION: 
Pass floating-point 
data on RAA input to the sum bus while inverting 
the sign bit. The operand 
on the RAB input is ignored 
by this instruction. 
MNEMONIG;· 
PNA 
STATUS 
N: 
Inverted 
sign of A. 
Z: 
Set when A is zero. 
COUT/U: 
0 
OVER: 
0 


INSTRUCTION: FLOAT TO INTEGER CONVERSION OPCODE : AI = 5 hex 
DESCRIPTION: 
Converts 
a floating-point 
internal 
F format 
number 
on the RAA port to a signed 
integer 
with 
no rounding. 
The operand 
on the RAB port is ignored 
by this instruction. 
The resulting 
integer 
has the eight 
exponent 
bits set to O. 
MNEMONIC: 
F21 
STATUS 
Sign of the result. 
Set when the result is zero. 
O. Underflow 
actually 
occurs when the biased exponent 
is less than129. 
Although 
the floating 
point operand 
is too small to be represented, 
its truncated 
fractional 
value is equal to an integer 
zero. The result output 
on the sum bus is therefore 
a integer 
zero, and no underflow 
status 
is 
set. 


Overflow 
occurs 
when the biased 
exponent 
is larger than 160 decimal 
(except 
for the case 
where 
the exponent 
is 161 decimal, 
the sign negative, 
and the explicit 
mantissa 
is all zero). 
Therefore, 
the floating-point 
operand 
is too large to be represented 
in integer format. The result 
output 
on the sum bus is the 2s complement 
value of the mantissa 
unshifted. 


INSTRUCTION: FLOAT TO FRACTION CONVERSION OPCODE : AI = 6 hex 
DESCRIPTION: 
Converts 
a floating-point 
internal 
F format 
number 
on the RAA port to a signed 
integer 
with 
radix point to the right of the MSB and with no rounding. 
The operand 
on the B port is ignored 
by this instruction. 
The resulting 
fraction 
has the eight exponent 
bits set to O. 


MNEMONIC: 
F2FR 
STATUS 


N: 
Z: 
COUT/U: 


N: 
Sign of the result. 


Z: 
Set when the result is zero. 
COUT/U: 
O. Underflow 
occurs when the biased 
exponent 
is less than 97 decimal. 
Although 
the floating 
point operand 
is too small to be represented, 
its truncated 
fractional 
value is equal to a fractional 
zero (0.0 ...0). The result output on the sum bus is therfore 
an integer 
zero, and no underflow 
status 
is set. 


OVER: 
Overflow 
occurs 
when the biased 
exponent 
is larger than 128 decimal 
(except 
for the case 
where 
the exponent 
is 129 decimal, 
the sign negative, 
and the explicit 
mantissa 
is all zero). 


Therefore, 
the floating-point 
operand 
is too large to be represented 
in signed fractional 
format. 


The result output 
on the SUM bus is the 2s complement 
value of the mantissa 
unshifted. 
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INSTRUCTION: 
INTEGER 
TO EXPONENT 
CONVERSION 
OPCODE 
: AI = 7 hex 
DESCRIPTION: 
Takes the least significant 
8 bits of operand 
RAA and inserts 
them into the exponent 
field of 
operand 
RAB, thereby 
replacing 
the operand 
RAB'S 
exponent. 
The mantissa 
and sign bit of operand 
RAB are 
left unaltered. 
MNEMONIC: 12E 
STATUS 
N: 
Sign of the result. 


z: 
Set when the result is zero. 


COUT/U: 
0 
OVER: 
0 
STEX: 


EXAMPLE: 
RAA = H'OOOFED7B98' 
RAB = H'83800154A7' 
Result = H'98800154A7' 
Status = B'1000' 


RAA = H'00AG701234' 
RAB = H'FF800154A7' 
Result = H'34800154A7' 
Status = B'1 000' 


INSTRUCTION: 
A + B + CIN OPCODE 
: AI = 8 hex 
DESCRIPTION: 
Integer 
addition 
of operand 
RAA to RAB with a carry-in 
from GIN pin. 
MNEMONIC: 
STATUS 
N: 
Set normally 
as required. 


Z: 
Set normally 
as required. 


COUT/U: 
Set normally 
as required. 


OVER: 
Set normally 
as required. 


INSTRUCTION: 
A + B + CFF OPCODE 
: AI = 9 hex 
DESCRIPTION: 
Integer add of RAA to RAB with a carry-in 
from carry flip-flop. 
The carry flip-flop 
is enabled 
for 
each integer 
arithmetic 
operation. 


MNEMONIC: 
STATUS 
N: 
Set normally 
as required. 
Z: 
Set normally 
as required. 


COUT/U: 
Set normally 
as required. 
OVER: 
Set normally 
as required. 


INSTRUCTION: 
A + B + CIN OPCODE 
: AI = A hex 
DESCRIPTION: 
Integer 
add of RAA to NOT RAB with a carry-in 
from GIN pin. 
MNEMONIC: 
STATUS 
N: 
Set normally 
as required. 
Z: 
Set normally 
as required. 


COUT/U: 
Set normally 
as required. 


OVER: 
Set normally 
as required. 


STEX: 
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PROGRAMMING 
INFORMATION 


INSTRUCTION: 
A + B + CFF OPCODE 
: AI = B hex 
DESCRIPTION: 
Integer add of RAA to NOT RAB with a carry-in from carry flip-flop. 
The carry flip-flop 
is enabled 
for every integer 
arithmetic 
operation. 


MNEMONIC: 
STATUS 
N: 
z: 
COUT/U: 


OVER: 


Set normally 
as required. 
Set normally 
as required. 
Set normally 
as required. 
Set normally 
as required. 


INSTRUCTION: 
A + B + CIN OPCODE 
: AI = C hex 


DESCRIPTION: 
Integer 
add of NOT RAA to RAB with a carry-in 
from elN 
pin. 
MNEMONIC: 
STATUS 
N: 
Set normally 
as required. 
z: 
Set normally 
as required. 


COUT/U: 
Set normally 
as required. 
OVER: 
Set normally 
as required. 


INSTRUCTION: 
PASS A / B on CIN OPCODE 
: AI = 0 hex 


DESCRIPTION: 
This instruction 
passes 
the A or B operand 
depending 
on the value of the elN 
bit. If elN = 0 
then A is passed, 
if elN = 1 then B is passed. 
This instruction 
passes 
both integer and floating-point 
operands. 


MNEMONIC: 
STATUS 
N: 
Set normally 
as required. 
Z: 
Set normally 
as required 
for integers 
(see Note 2) 


COUT/U: 
o. 


OVER: 
O. 
CAUTION: 
THE STATUS REGISTER 
MUST BE ENABLED 
DURING THIS INSTRUCTION. 
This is necessary 
to avoid unstable conditions 
resulting from the status being fed back into the ALU control. 


NOTE 2: 
If the operands being passed are floating-point, the pass operation will work correctly. However, the ZERO status may be incorrect. This 
instruction is part of the ALU integer instruction set and therefore for all operands being passed the ZERO status is determined as if the 
operand were an integer. For more details see floating-point 
status under status section. 


INSTRUCTION: 
MIN / MAX on CIN OPCODE 
: AI = E hex 


DESCRIPTION: 
This instruction 
passes the A or B operand 
depending 
on the registered 
ALU status bits of the 
previous 
operation. 
When elN = 0, the minimum 
value is selected; 
when elN = 1, the maximum 
value is selected. 
The comparison 
between 
the value of A and B is based 
on the ALU status 
bits N and OVER 
(see Table 22). 


MNEMONIC: 
STATUS 
N: 
Set normally 
as required. 
Z: 
Set normally 
as required 
for integers. 
(see Note 3) 
COUT/V: 
0 
OVER: 
0 
CAUTION: 
STATUS REGISTER MUST BE ENABLED DURING THIS INSTRUCTION. 
This is necessary to avoid unstable conditions resulting 


from the status being fed back into the ALU control in this instruction. 
NOTE 3: 
If the operands being passed are floating-point, 
the pass operation will work correctly. However, the ZERO status may be incorrect. This 


instruction is part of the ALU integer instruction set and therefore for all operands being passed the ZERO status is determined as if the 
operand were an integer. For more details see floating-point 
status under status section. 
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INSTRUCTION: A + CIN OPCODE : AI = F hex 
DESCRIPTION: 
Addition of RM to zero with a carry-in from CIN pin. 
MNEMONIC: 
STATUS 
N: 
Sign of the result. 


Z: 
Set when the result is zero. 
COUT/U: 
Carry out of the MSB. 
OVER: 
Set normally as required. 


INSTRUCTION: A + CIN OPCODE: AI = 10 hex 
DESCRIPTION: 
Integer addition of NOT RAA to zero with a carry-in from CIN pin. When CIN = 1, then this is 
the 2s complement of RM which is equivalent to the (2-A) operation in the Newton-Raphson divide algorithm. 
MNEMONIC: 
STATUS 
N: 
Sign of the result. 


Z: 
Set when the result is zero. 
COUT/U: 
Carry out of MSB. 


OVER: 
Set normally as required. 


INSTRUCTION: A + CFF OPCODE : AI = 11 hex 
DESCRIPTION: 
Integer addition of NOT RAA to zero with a carry-in from carry flip-flop. As long as CFF = 1, 


then this isthe 2s complement of RM which is equivalent to the (2 - A) operation in the Newton-Raphson divide 
algorithm, otherwise this is a NEGATE operation. 
MNEMONIC: 
STATUS 
N: 
Sign of the result. 


Z: 
Set when the result is zero. 
COUT/U: 
Carry out of MSB. 


OVER: 
Set normally as required. 


INSTRUCTION: A AND B OPCODE : AI = 12 hex 
DESCRIPTION: 
Logical AND of RAA and RAB. 
MNEMONIC: 
STATUS 
N: 
Sign of the result. 


Z: 
Set when the result is zero. 
COUT/U: 
o. 
OVER: 
O. 


INSTRUCTION: A AND B OPCODE : AI = 13 hex 
DESCRIPTION: 
Logical AND of NOT RAA and RAB. 


MNEMONIC: 
STATUS 
N: 
Sign of the result. 


Z: 
Set when the result is zero. 
COUT/U: 
0 
OVER: 
0 
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INSTRUCTION: 
A XOR 
B OPCODE 
: AI = 14 hex 
DESCRIPTION: 
Exclusive OR of RM 
and RAB. 
MNEMONIC: 
STATUS 
N: 
Sign of the result. 


Z: 
Set when the result is zero. 


COUT/U: 
0 
OVER: 
0 


INSTRUCTION: 
A OR B OPCODE 
: AI = 15 hex 
DESCRIPTION: 
Logical OR of RM 
and RAB. 


MNEMONIC: 
STATUS 
N: 
Sign of the result. 


Z: 
Set when the result is zero. 


COUT/U: 
0 
OVER: 
0 


INSTRUCTION: 
A NAND 
B OPCODE: 
AI = 16 hex 
DESCRIPTION: 
Logical NAND of RM 
and RAB. 
MNEMONIC: 
STATUS 
N: 
Sign of the result. 


Z: 
Set when the result is zero. 


COUT/U: 
0 
OVER: 
0 


INSTRUCTION: 
A NOR B OPCODE: 
AI = 17 hex 
DESCRIPTION: 
Logical NOR of RM 
and RAB. 
MNEMONIC: 
STATUS 
N: 
Sign of the result. 


Z: 
Set when the result is zero. 
COUT/U: 
0 
OVER: 
0 


INSTRUCTION: 
SHIFT 
RIGHT 
LOGICAL 
A OPCODE 
: AI = 18 hex 
DESCRIPTION: Performs a barrel shift right on operand RM. 
The shift count comes from the least significant 
5 bits of the RAB operand. The more significant bits of operand RAB are not used. The shift quantity is always 
interpreted as a positive value. This instruction also clears the link flip-flop. Zeroes are filled in from the left. 
MNEMONIC: SRL 
STATUS 
N: 
Sign of the result. 


Z: 
Set when the result is zero. 
COUT/U: 
0 
OVER: 
0 


EXAMPLE: 
RM 
= H'OOA170258D' RAB = H'OOAD9754CO' 
Result = H'OOA170258D' Status = B'1000' 
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INSTRUCTION: 
SHIFT 
RIGHT 
ARITHMETIC 
A OPCODE: 
AI = 19 hex 


DESCRIPTION: 
Performs 
a barrel shift right on operand 
RM, 
filling with the sign bit. The shift count 
comes 


from the least significant 
5 bits of the RAB operand. 
The shift quantity 
is always 
interpreted 
as a positive 
value. 


This instruction 
also clears the link flip-flop 
upon every 
execution. 
The result is sign filled on the left. 


MNEMONIC: 
SRA 


STATUS 
N: 
Sign of the result. 


Z: 
Set when the result is zero. 
COUT/U: 
0 


OVER: 
0 


INSTRUCTION: 
ROTATE 
RIGHT 
B OPCODE: 
AI = 1A hex 


DESCRIPTION: 
Rotates 
operand 
RAB to the right by one bit position. 
The most significant 
bit is filled with the 


contents 
of the link flip-flop, 
and the least significant 
bit lost during 
the rotation 
is stored 
in the link flip-flap 
on 


the next rising edge of the clock. The operand 
on the RM 
port is ignored. 


MNEMONIC: 
ROR 


STATUS 
N: 
Sign of the result. 


Z: 
Set when the result is zero. 
COUT/U: 
0 


OVER: 
0 


EXAMPLE: 
RM 
= H')()()()()()()()' 
RAB = H'00A170258D' 
Link Flip-Flop 
= 0 


Result = H'0050B812C6' 
Status = B'OOOO'. Link Flip-Flop 
= 1 


RM 
= H')()()()()()()()' 
RAB = H'0050B812C6' 
Link Flip-Flop 
= 1 


Result = H'00A85C0963' 
Status = B'1000' 
Link Filp-Flop 
= 0 


RM 
= H')()()()()()()()' 
RAB = H'00A85C0963' 
Link Flip-Flop 
= 0 


Result = H'00542E04B1' 
Status = B'OOOO' Link Flip-Flop 
= 1 


INSTRUCTION: 
ROTATE 
LEFT 
B OPCODE 
: AI = 1B hex 


DESCRIPTION: 
Rotates 
operand 
RAB to the left one bit position. 
The least 
significant 
bit is filled 
with the 


contents 
of the link flip-flop, 
and the most significant 
bit lost during 
the rotation 
is stored 
in the link flip-flop 
on 


the very next rising edge of the clock. The operand 
on the RM 
port is ignored 
by this instruction. 
MNEMONIC: 
ROL 


STATUS 
N: 
Sign of the result. 


Z: 
Set when the result is zero. 
COUT/U: 
0 


OVER: 
0 


EXAMPLE: 
RM 
= H')()()()()()()()' 
RAB = H'00542E04B1' 
Link Flip-Flop 
= 1 


Result = H'00A85C0963' 
Status = B'1000' 
Link Flip-Flop 
= 0 


RM 
= H'xxxxxxxxxx' 
RAB = H'00A85C0963' 
Link Flip-Flop 
= 0 


Result = H'0050B812C6' 
Status = B'OOOO' Link Flip-Flop 
= 1 


RM 
= H'xxxxxxxxxx' 
RAB = H'0050B812C6' 
Link Flip-Flop 
= 1 


Result = H'OOA170258D' 
Status 
= B'1000' 
Link Flip-Flop 
= 0 
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INSTRUCTION: SHIFT LEFT LOGICAL A OPCODE : AI = 1C hex 
DESCRIPTION: 
Performs 
a barrel shift left on operand 
RM. 
The shift count comes from the least significant 
5 bits of the RAB operand. 
The shift quantity 
is always 
interpreted 
as a positive value. This instruction 
also sets 
the link flip-flop 
upon every execution. 
Zeroes 
are filled from the right. 


MNEMONIC: SLL 
STATUS 
N: 
Sign of the result. 


Z: 
Set when the result is zero. 


COUT/U: 
0 
OVER: 
0 


INSTRUCTION: EXPONENT TO INTEGER CONVERSION OPCODE : AI = 1D hex 
DESCRIPTION: Takes the eight-bit 
exponent 
field of operand 
A and inserts them into the least significant 
eight 
bits of the output. 
All other bits are zeroed. 
The operand 
on the RAB input is ignored 
by this instruction. 


MNEMONIC: E21 
STATUS 
N: 
0 
Z: 
Set when the result is zero. 
COUT/U: 
0 
OVER: 
0 


EXAMPLE: 
RM 
= H'84005D7900' 
RAB = H'xxxxxxxxxx' 
Result = H'0000000084' 
Status = B'OOOO' 


RM 
= H'C9F876CCOO' 
RAB = H'xxxxxxxxxx' 


Result = H'00000000C9' 
Status = B'OOOO 


INSTRUCTION: INTEGER TO FLOAT CONVERSION OPCODE : AI = 1E hex 
DESCRIPTION: 
Converts 
an integer 
to a floating-point 
number. 
The operand 
on the RAB input is ignored 
by 
this instruction. 
MNEMONIC: 12F 
STATUS 
N: 
Sign of the result. 


Z: 
Set when the result is zero. 
COUT/U: 
0 
OVER: 
0 


INSTRUCTION: FRACTIONAL TO FLOAT CONVERSION OPCODE : AI = 1F hex 
DESCRIPTION: 
Converts 
an integer 
with radix point to the right of the MSB to a floating-point 
number. 
The 
operand 
on the RAB input is ignored 
by this instruction. 
MNEMONIC: FR2F 
STATUS 
N: 
Sign of the result. 


Z: 
Set when the result is zero. 
COUT/U: 
0 
OVER: 
0 
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Design Support 


Design Support for TI's SN74ACT8800 
Family 


Tl's '8800 32-bit processor family is supported by a variety of tools developed to aid 
in design evaluation and verification. These tools will streamline all stages of the design 
process, from assessing the operation and performance of an individual device to 
evaluating a total system application. The tools include functional models, behavioral 
models, microcode development software, as well as the expertise of Tl's Datapath 
VLSI Products Systems Engineering group. 


Functional I;valuation Models Aid in Device Evaluation 


Many design decisions can easily be made and evaluated before hardware or board 
prototypes are needed, using functional evaluation software models. The result is 
shortened design cycles and lower design costs. 


Texas Instruments offers functional evaluation models for many of the devices in the 
'8800 family. These models are written in Microsoft C® and can be used in stand- 
alone mode or as callable functions. 


These models are designed to provide insight into the operation of the devices by 
allowing the designer to write microcode and run it through the model. This allows 
the designer to select the device that best executes a specific application and provides 
a head start in evaluating programming performance. 


The models correctly represent device timing in clock cycles, measured from the input 
of control and data to the output of results and status. Hence, initial performance 
estimates for a particular design can be made by relating the number of clock cycles 
required for an operation to the typical ac timing data for the device. 


Behavioral Simulation Models Simplify System Debugging 


System simulation with behavioral models can further shorten design time and ease 
design effort. The behavioral simulation models that support Tl's '8800 chip set have 
the timing-control and error-handling capability to perform thorough PCBand system 
simulation. These models decreasethe time spent in debugging and reducethe number 
of required prototype runs. 


Users of system simulation models report a reduction by more than half in the number 
of prototype runs typically requiredto produce the highest-quality system. This savings 
in time reduces costs and gets the product to market as much as several months earlier 
than could be done using traditional methods. 


Behavioral models for TI's '8800 family are written at the functional behavioral level 
and, therefore, 
are faster and easier to use and take up less disk space than some 
other types of simulation models. This higher efficiency 
means a simulation run can 
include more IC models and yet require less CPU time than an equivalent simulation 
using other types of models. 


These behavioral simulation models also provide explicit error messages that can help 
in the debugging process. For example, if a design violates a device set-up time, the 
model explains, via an error message, what type of violation occurred, at what point 
it occurred in the simulation run, and specifically which part's set-up time was violated. 
Then, the model continues on with the run as if no violation occurred, saving time 
rather than crashing the run at every error. 


In other words, an expert debugger is built right into the simulation. 


The models are available with commercial and military timing and interact with a variety 
of simulators. 


Behavioral Models for TI's '8800 
Family are Easily Obtained 


Texas Instruments has been working closely with both Quadtree Software Corporation 
and Logic Automation Incorporated to produce software behavioral simulation models 
of many of its VLSI devices. Since accuracy is key to solving design problems, we've 
provided Quadtree and Logic Automation 
with test patterns for most of our devices 
to ensure each model passes the same set of test vectors as does the actual silicon 
device. 


Quadtree offers a library of Designer's Choice'" full-functional 
behavioral models of 
Texas Instruments 
'8800 
32-bit processor building block devices. 


Logic Automation 
Smartmodel'" library contains many Texas Instruments products, 
including devices from the '8800 
chip set. 


These companies 
may be contacted 
directly 
at the 
addresses below. 
General 
information about behavioral model support for the '8800 family may be obtained by 
calling Texas Instruments 
at (214) 997-5402. 


LOGIC AUTOMATION 
INCORPORATED 
P.O. Box 310 
Beaverton, OR 97075 
(503) 690-6900 


QUADTREE SOFTWARE CORPORATION 
1170 Route 22 East 
Bridgewater, 
NJ 08807 
(201) 725-2272 


Quadtree and Designer's Choice are trademarks 
of Quadtree Software 
Corporation 
Logic Automation 
and Smartmodel are trademarks of Logic Automation 
Incorporated 


'8800 
SOB Design Kit 


TI offers an '8800 
Software Development Board (SOB) Design Kit as an evaluation 
and training tool. The '8800 SOB kit uses a range of software development tools to 
allow users to evaluate performance and write microprograms for several of the '8800 
building blocks. Using the SOB,microcode can be developed earlier in a system's design 
cycle so that code development parallels, rather than follows, 
prototype design. 


The '8800 SOBDesign Kit consists of a combination of specially developed hardware, 
software, 
and documentation including: 


• 
The '8800 
Software Development Board Assembly 


• 
Floppy disk with MS-DOS'· software tools written in Microsoft C, several 
example microprograms, and demo programs. Source code is included. 


• 
Microcode definition files for use with HILEVEL,STEPEngineering, and Texas 
Instruments microcode development tools. 


Built on a PC/AT card occupying a single slot, the '8800 SOBcontains an 'ACT8818A 
microsequencer, 'ACT8832A registered ALU, and an 'ACT884 7 floating-point/integer 
unit, along with 32 K by 128 bits of microcode memory, and 32 K by 32 bits of local 
data memory. A block diagram of the '8800 SOB is detailed in Figure 8-1. The board 
operates under an MS-DOS environment. 


The SOB Design Kit complements other '8800 
family development tools such as 
functional evaluation and behavioral simulation models. It actually provides the next 
step beyond simulators. System code can be executed in a realtime environment that 
includes conditional branching, on-board data memory, and single-step/breakpoint 
facilities. 


For additional 
technical 
information, 
contact 
Datapath 
VLSI Products 
Systems 
Engineering at (214) 997-3970. 
For ordering information, please call your local field 
sales representative. 
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Program Code Generation Using the TI Meta Assembler 


The TI Meta Assembler (TIM) provides the means to create object microcode files and 
to support 
listings 
for programs that 
execute 
in architectures 
without 
standard 
instruction sets. The end-product of TIM is an absolute object code module in suitable 
format 
for downloading 
to PROM programmers 
or to the emulator 
memories of 
development systems. TIM is fully compatible with some other assemblers as well. 


Systems Expertise is a Phone Call Away 


Texas Instruments Datapath VLSI Products Systems Engineering group is available 
to help designers analyze TI's high-performance 
VLSI products, such as the '8800 
32-bit processor family. The group works directly with designers to provide ready 
answers to device--related questions 
and also prepares a variety 
of applications 
documentation. 


Design Support 
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GB pin grid array ceramic package 
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(See Note A) 


ALL 
POSSIBLE 
PIN LOCATIONS 
ARE 
SHOWN. 
SEE APPLICABLE 
PRODUCT 
DATA 
SHEETS 
FOR 
ACTUAL 
PIN 
LOCATIONS 
USED. 


NOTES: A. Pin tips are located with 0,13 10.005) radius of true position relative to each other at 
maximum material condition and within 0,46 (0.018) radius relative to the center of 
the ceramic. 
B. Dimensions do not include solder finish. 
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NOTES: A. Pin tips are located with 0,13 (0.005) radius of true position relative to each other at 
maximum material condition and within 0,46 (0.018) radius relative to the center of 
the ceramic. 
B. Dimensions do not include solder finish. 
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ALL POSSIBLE PIN LOCA nONS 
ARE 
SHOWN. 
SEE APPLICABLE PRODUCT 


DATA 
SHEETS 
FOR 
ACTUAL 
PIN 


LOCATIONS USED. 


NOTES: A. Pin tips are located with 0,13 (0.005) radius of true position relative to each other at 
maximum material condition and within 0,46 (0.018) radius relative to the center of 
the ceramic. 
B. Dimensions do not include solder finish. 
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NOTES: A. Pin tips are located with 0,13 (0.005) radius of true position relative to each other at 
maximum material condition and within 0,46 (0.018) radius relative to the center of 
the ceramic. 
B. Dimensions do not include solder finish. 
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All 
POSSIBLE PIN lOCATIONS 
ARE 
SHOWN. SEE APPLICABLE PRODUCT 
DA TA 
SHEETS 
FOR ACTUAL 
PIN 
lOCATIONS 
USED. 


NOTES: 
A. Pin tips are located with 0,13 (0.005) radius of true position relative to each other at 
maximum material condition and within 0,46 (0.018) radius relative to the center of 
the ceramic. 


B. Dimensions do not include solder finish. 
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NOTES: A. Pin tips are located with 0,13 (0.005) radius of true position relative to each other at 
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B. Dimensions do not include solder finish. 
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NOTES: 
A. Pin tips are located with 0,13 (0.005) radius of true position relative to each other at 
maximum material condition and within 0,46 (0.018) radius relative to the center of 
the ceramic. 


B. Dimensions do not include solder finish. 
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NOTES: A. Pin tips are located with 0,13 (0.005) radius of true position relative to each other at 
maximum material condition and within 0,46 (0.018) radius relative to the center of 
the ceramic. 
B. Dimensions do not include solder finish. 


